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Feedback  System  Design: 

The  Single-Variate  Case  —  Part  I* 

R.  SaeksK  J.  Murray  >,  O.  Chua \  C.  Karmokolias \ 
and  A.  Iyer* 


Abstract.  A  recently  developed  algebraic  approach  to  the  feedback  tyncm  design 
problem  is  reviewed  via  the  derivation  of  the  theory  in  the  single* variate  ease.  This 
allows  the  simple  algebraic  nature  of  the  theory  to  be  brought  to  the  fore  while 
simultaneously  minimising  the  complexities  of  the  presentation.  Rather  than  simply 
giving  a  single  solution  to  the  prescribed  design  problem  we  endeavor  to  give  a  com¬ 
plete  parameterization  of  the  set  of  compensators  which  meet  specifications. 
Although  this  might  at  first  seem  to  complicate  our  theory  h,  in  fact,  opens  the  way 
for  a  sequential  approach  to  the  design  /problem  in  which  one  parameterizes  the 
subset  of  those  compensators  which  meet  the  second  specification.,  .etc;  Specific 
problems  investigated  include  feedback  system  stabilization,  the  tracking  and  distur¬ 
bance  rejection  problem,  robust  design,  transfer  function  design,  poke  placement, 
simultaneous  stabilization,  and  stable  stabilization. 

1.  Introduction 

In  1976  Youla,  Bongiorno,  and  Jnbr  published  two,  now  classical,  papers 
[23,24]  in  which  a  complete  parameterization  of  the  set  of  stabilizing  com¬ 
pensators  for  a  multivariate  feedback  system  was  obtained.  In  the  ensuing 
years  this  work,  which  is  often  termed  the  YBJ  theory,  has  led  to  the 
development  of  an  entirely  new  approach  to  the  feedback  system  design 
problem.  Indeed,  their  stabilization  theory  has  been  extended  to  include: 

(i)  the  tracking  and  disturbance  rejection  problem 

(ii)  robust  design  algorithms 

(iii)  design  with  a  proper  or  stable  compensator 

(iv)  transfer  function  design 
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(v)  pole  placement 

(vi)  simultaneous  stabilization 

Moreover,  much  of  the  work  has  been  extended  to  the  case  of  general  linear 
systems;  distributed,  time- varying,  multidimensional,  etc.;  by  formulating 
it  in  an  abstract  ring  theortic  (8,10)  or  altebro-geomctrie  setting  (17). Un¬ 
fortunately,  these  generalizations  have  been  achieved  at  the  cost  of  increas¬ 
ing  the  Qomptoriry  of  the  theory  and,  as  such,  the  simple  algebraic  character 
of  the  work  has  been  obscured. 

The  purpose  of  the  present  paper  is  to  survey  this  literature  in  such  a  way 
as  to  illustrate  the  simplicity  of  the  theory.  To  this  end  the  presentation  is 
restricted  to  the  single- variate  case  wherein  a  simple  algebraic  theory  is 
possible.  Indeed,  by  so  doing  we  are  able  to  give  simple  single-variate 
algebraic  derivations  for  several  results  whose  true  character  has  hitherto 
bees  obscured  by  the  abstract  ring  theoretic  or  multivariable  theory. 

The  key  to  our  theory  is  a  three  step  design  philosophy 

(i)  stabilization 

(H)  achievement  of  design  constraints 

(iii)  optimization  of  system  performance 
First  and  foremost,  a  feedback  system  must  be  stabile  and,  as  such,  the  first 
step  in  the  design  process  is  the  parameterization  of  all  stabilizing  compen¬ 
sators  for  the  given  plant.  Although  it  might  suffice  1 5  specify  a  single 
stabilizing  compensator  if  our  goel  was  simply  to  design  a  stable  system,  in 
practice  stabilization  is  only  the  first  step  of  the  design  process.  As  such,  we 
must  characterize  afl  stabilizing  compensators  If  w»  are  to  choose  among  the 
stabilizing  compensators  to  find  one  which  also  achieves  the  design  con¬ 
straints  and/or  optimizes  system  performance.  A  complete  parameteriza¬ 
tion  of  the  set  of  stabilizing  compensators  for  the  given  plant  is  thus  ob¬ 
tained  as  a  first  step  in  the  design  process.  Indeed,  the  parameterization  is 
chosen  in  such  a  way  that  the  various  feedback  system  gains  are  linear 
(affine)  in  the  resultant  design  parameter,  thereby  setting  the  stage  for  the 
choice  of  a  design  parameter  which  also  achieves  the  design  constraints 
and/or  optimists  some  measure  of  system  performance.  • 

Once  the  stabilizing  compensators  have  been  characterized,  step  two  of 
the  design  process  is  to  choose  a  subset  of  the  stabilizing  compensators 
which  also  achieve  the  prescribed  design  constraints;  tracking  and  distur¬ 
bance  rejection,  transfer  function  specification,  robustness,  etc.  Finally,  if 
any  remaining  design  latitude  exists  after  the  design  constraints  have  been 
met  it  may  be  used  to  optimize  some  measure  of  system  performance;  sen¬ 
sitivity,  energy  consumption,  etc. 

The  paper  is  divided  into  two  pans  dealing  with  the  classical  asymptotic 
design  probiemr.  stabilization,  tracking,  and  disturbance  rejection;  and  a 
survey  of  modem  frequency  domain  design;  robust  design,  pole  placement, 
simultaneous  design,  respectively.  In  the  remainder  of  this  introduction  the 
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fractional  representation  theory  for  a  single-variate  system  is  developed. 
The  key  to  this  theory  lies  with  the  representation  of  a  rational  function  as  a 
ratio  of  stable  rational  functions  rather  than  as  a  ratio  of  polynomials.  Such 
a  formulation  opens  the  door  to  the  desired  generalization,  wherein  stability 
is  well  defined  even  though  no  analog  of  a  polynomial  exists.  Moreover,  it 
yidds  what  we  believe  to  be  a  more  natural  concept  of  coprimeness  in  which 
only  cancellations  between  (closed)  right  half-plane  zeros  are  forbidden.  In¬ 
deed,  the  (strict)  left  half-plane  plays  only  a  minimal  role  in  the  theory. 

In  Section  2  a  derivation  of  the  YBJ  stabilization  theory  is  formulated  in 
terms  of  a  stable  coprime  fractional  representation.  Although  this  deriva¬ 
tion  has  appeared  before  [4,8],  even  in  the  single-variate  case,  a  complete 
proof  is  given  because  of  its  fundamental  nature  to  the  remainder  of  the 
work.  Indeed,  the  proof  technique  introduced  here  is  repeated,  in  one  form 
or  another,  throughout  the  paper. 

Sections  3-5  are  devoted  to  the  tracking  and  disturbance  rejection  pro¬ 
blems  [4].  Unlike  the  stabilization  problem  a  solution  to  these  problems 
may  fail  to  exist.  Necessary  and  sufficient  conditions  for  the  existence  of  a 
solution  are,  however,  given  in  the  form  of  appropriate  coprimeness  criteria 
and  a  complete  parameterization  of  the  required  set  ofxompensators  is  ob¬ 
tained  when  these  criteria  are  satisfied. 

Part  II  of  the  paper  begins  with  Section  6  in  which  the  problem  of  robust 
design  is  taken  up.  Unlike  the  stabilization  problem  for  which  every  solu¬ 
tion  is  robust  a  solution  to  the  tracking  and/or  disturbance  rejection  prob¬ 
lem  may  fail  to  be  robust.  Surprisingly,  however,  whenever  these  problems 
are  solvable  they  are  robustly  solvable.  As  such,  beginning  with  the  same 
coprimeness  criteria  used  in  the  non-robust  case  we  give  an  explicit 
parameterization  for  the  set  of  compensators  which  robustly  solve  the 
tracking  and  disturbance  rejection  problems.  This  result,  however,  only  ap¬ 
plies  to  our  single-variate  case.  In  the  general  multivariate  case  a  robust 
solution  may  fail  to  exist  even  though  a  non-robust  solution  exists  [11]. 

In  Section  7  the  problem  of  designing  a  compensator  which  simul¬ 
taneously  stabilizes  a  feedback  system  and  realizes  a  prescribed  input- 
output  feedback  system  gain  is  investigated.  The  required  existence  criteria 
for  this  transfer  function  design  problem  are  formulated  in  terms  of  a 
divisability  condition  in  the  ring  of  stable  transfer  functions.  This  is  followed  in 
Section  8  by  an  investigation  of  the  pole  placement  problem  in  which  one 
desires  to  construct  a  compensator  which  will  simultaneously  stabilize  a 
system  and  place  the  poles  of  its  input-output  gain  at  prescribed  points  in 
the  left  half-plane.  Interestingly,  the  extent  to  which  this  end  can  be  achieved  is 
determined  precisely  by  the  “degree"  to  which  the  plant  fails  to  be 
miniphase. 

In  Section  9  the  problem  of  designing  a  compensator  which  simul¬ 
taneously  stabilizes  two  distinct  plants  is  solved.  Although  this  rwo  plant 
problem  is  a  very  special  case  of  the  general  simultaneous  design  problem  it 
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is  the  one  example  of  the  problem  for  which  a  definitive  frequency  domain 
design  criterion  exists  [7]  and  is  thus  indicative  of  the  direction  of  future 
research  in  this  area.  Moreover,  the  problem  of  stabilizing  a  feedback 
system  with  a  stable  compensator  [25]  proves  to  be  a  special  case  of  this  two 
plant  problem  which  is  developed  in  Section  10. 

Section  11  is  devoted  to  a  short  discussion  of  the  1 optimization  problem * 
associated  with  step  three  of  our  design  process.  Since  the  specific  optimiza¬ 
tion  one  might  choose  to  undertake  is  dependent  on  the  physical  system 
under  study  and  its  application  the  development  in  this  section  concentrates 
on  the  interface  between  our  theory  and  the  optimization  process,  without 
going  into  specifics. 

Finally,  Section  12,  is  devoted  to  an  historical  overview  of  the  theory  and 
a  discussion  of  the  various  generalizations  end  extensions  which  thus  far 
have  been  formulated. 

Our  system  will  be  described  by  a  rational  function 


ris)  m 


Pis) 

Sis) 


(1.1) 


Such  a  system  is  said  to  be  stab/e  if  its  poles  lie  in  the  (strict)  left  half-plane. 
Since  the  point  at  infinity  is  taken  to  lie  on  the  imaginary  axis  this  implies 
that  ris)  is  stable  if  and  only  if  it  is  a  proper  rational  function  and  q{s)  is  a 
(strictly)  Hurwitz  polynomial. 

K  fractional  representation  for  r(s)  is  a  factorization  of  r(s)  in  the  form 

*,(*> 

d,is) 

where  both  nr{s)  and  dr{s)  are  stable  and  dr{s)  *  0.  If  one  is  given  a 
polynomial  fractional  representation  for  ris)  such  as  in  equation  1.1  then 
one  can  take 


ris ) 


(1.2) 


and 


*’>  ■ 

(1J) 

-  S8r 

(1.4) 

where  m(j)  is  any  Hurwitz  polynomial  such  that  the  order  of  mis)  equals 
the  order  of  r(s)  verifying  the  existence  of  the  required  fractional  represen¬ 
tation  [11]. 

We  say  that  the  fractional  representation  ris)  *  n,(s)/d,(s)  is  coprime 
if  there  exist  stable  rational  functions,  uf{s)  and  vf[s),  such  that 


u,(s)nf(s)  +  vris)drit)  ■  1 


(1.5) 
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Recall  that  for  polynomials  l.S  is  equivalent  to  the  requirement  that  nr{s) 
and  dr(s)  do  not  have  any  common  zeros  12] .  In  our  case,  however,  where 
we  are  dealing  with  stable  rational  functions,  Equation  1 .5  impks  that  n,(s) 
and  dr(s)  have  no  common  (closed)  right  half-plane  zeros  and  conversely 
(4,11].  Although  this  represents  a  departure  from  classical  control  theory 
only  right  half-plane  zeros  cause  instability  and,  as  such,  it  is  appropriate 
that  only  right  half-plane  pole  zero  cancellations  be  forbidden. 

Unlike  the  classical  polynomial  fractional  representation  theory  wherein 
the  units  are  the  constant  functions  in  our  theory  the  units  are  the  miniphase 
rational  functions  which  are  stable  and  admit  a  stable  inverse.  That  is,  if 
r(s)  *  p(s)/Q(s)  then  p(s)  and  q(s)  are  both  Hurwitz  polynomials  of  the 
same  order.  As  such,  the  classical  theorems  for  polynomial  fractional 
representations  may  be  reformulated- in  our  setting  with  the  units  taken  to 
•  be  miniphase  rational  functions  as  follows. 

1.  Property.  Let  r (s)  =  nr(s)  idr{s)  be  coprime  fractional  representation 
for  r(s)  and  assume  that  *,(*)  and  dt(s)  admit  a  common  divisor,  k(s). 
such  that 


and 


d,(s)  *  y,(a)*(s> 


ntls)  «jrfU)*(s) 

where  yr(s) ,  xr(s)  and  k{s)  are  stable.  Then  fc(jr)  is  miniphase.  That  is,  the 
only  common  divisor  of  a  coprime  fractional  representation  is  a  unit. 

Proof.  Since  d,(s)  »  yr{s)k{s)  and  nf(s)  =»  x,(s)k(s)  are  coprime  there 
exist  stable  ur(s)  and  vr(s)  such  that 

1  ■  u,(*)a,(s)  +  o ,(s)drM  ■  (ur(s)xr(ti  +  v,(s)yr(t)  J  k(s)  <1.4) 

shewing  that  [ur(s)xr(s)  +  vf(s)y,(s)  ]  is  a  suble  inverse  for  *(r)  and 
hence  verifying  that  k{s)  is  miniphase. 

2.  Property.  Let  r(s)  «  n(s)  fd(s)  be  a  fractional  representation  for  r(s) 
and  let  r(s)  *  xr(s)  /yr(s)  be  a  coprime  fractional  representation  for  r(f). 
Then  there  exists  a  suble  k(s)  such  that 


and 


nr(s)  »  jrrU)*(r) 


df{s)  *  yr{s)k{s) 

Proof.  Given  the  two  fractional  representations  let  k(s)  *  dr(s)/yr(s). 
Then  clearly  dr(s)  «  yr(s)k(s)  while 

n(S)  m*M.dris)  -  rWd(s)  -  dT(s\  -  Jr,<s)*U)  <1.7) 
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showing  that  k  (r)  is  a  common  factor  of  nf(s)  and  d,.(s) .  It  thus  remains  to 
verify  that  k (a)  is  stable.  To  this  end  recall  that  since  xr(s)  /y,(s)  is  coprime 
there  exists  stable  ur{s)  and  v,{s)  such  that 


hence 


(u,(s)x,(a)  +  vr{s)yris)  J  •  1 


(l.«) 


*(a) 


d,ls) 

/,(*> 


lu,ls)x,ls)  +  vrls)y,ts)  ]  -&|L 


ur(s)xr(s)dr(s) 

yr(s) 


+  »rls)d,ls) 


ur(s)r\s)dr(s)  +  v,(s)d,(s)  ( 1.9 ) 


urls)it,(s)df(s)  . 

</,(*> 


« »,(a)d,(a) 


«,(s)irr(s}  +  v,(a)4,(x) 


showing  that  k(s)  is  stable  since  we  have  expressed  it  as  a  sum  of  products 
of  stable  rational  functions. 

Note  that  since  a  coprime  fractional  representation  always  exists  [11]  for 
r(s)  Property  2  implies  that  any  pair  of  stable  rational  functions,  4r,(a)  and 
y,(s),  can  be  expressed  in  the  form  x,(s)  »  nr{s)k(j)  and  yr(s)  * 
dr(s)k(s )  where  it, (a)  and  d,{a)  are  coprime  stable  rational  functions  and 
k(s)  is  stable.  As  such,  k(s)  represents  a  greatest  common  divisor  for 
x,(s)  and  y,(s)  which  is  unique  up  to  a  miniphase  factor  via  Property  1. 


3.  Property.  Let  r(s)  m  n,  (a)  /d,(a)  be  a  coprime  fractional  represents* 

tion  for  r(a).  Then  r(s)  is  stable  if  and  only  if  dr{s)  is  miniphase. 

• 

Proof.  If  dr{s)  is  miniphase  then  l/d,(a)  is  suble  and  hence  r(s),  being 
the  produa  of  two  suble  functions  is  suble.  Conversely,  if  r(s)  is  suble 
then  we  may  express  n,(a)  and  dr(s)  in  the  form 

n,(s)  «  r(s)df(s)  (1.10) 

and 


d,is)  »  1  dr[s)  (1.11)  . 

showing  that  dr(s)  is  a  common  factor  of  the  coprime  rational  functions 
/», (a)  and  dr(s).  As  such  Property  1  implies  that  d,(a)  is  miniphase  as  was 
to  be  shown. 

4.  Example.  Consider  the  rational  function 


r(s)  » 


(J+0 
(s+Q* 
(a- 2)  * 

(J  +  2). 


n,(s) 

dr(s) 


(1.12) 
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Now,  nr(s)  has  zeros  at  s  =  -  1  and  s  -  oa  while  dr(s)  has  a  2ero  at  s  =  2. 
As  such,  /i,(*)  and  d,(s)  have  no  common  right  half-plane  zeros  thus  im¬ 
plying  the  existence  of  stable  rational  functions  ur(s)  and  vr(s)  such  that 
[«,(*)«,(*)  +  o,(s)d,(j)]  «  1.  Indeed, 


[f  ] 


r  i£±mi 

L  U  +  2)  JLu+2)J 


-  ur(s)nr(s)  +  vr(s)dr(s)  »  1 


(1.13) 


Note  that  unlike  the  case  of  a  polynomial  fractional  representation  the  ex¬ 
istence  of  common  (strict)  left  half-plane  zeros  does  not  preclude  coprime- 
ness.  Indeed,  an  alternative  coprime  fractional  representation  for  the  above 
rational  function  takes  the  form 

riitilin 

.  ft;-. '1-1  .  t  1  =  <!fL 

L  a+v  J  - 

where  n/Cs)  and  d,'(s)  have  a  common  zero  at  s  =  -  1.  There  are,  however, 
still  coprime  since 

r  rj£±!£i  +  r  (s+2/3)  i  r  (j-2)  (j-h)  i 

L  3CS+I)  J  L  (X+2P  J  l(r+l)JL  U+2)»  J 

(1.15) 

-  u'r(s)n'r(s)  +  v'ls)d;is)  »  1 
2.  Stabilization 

The  basic  feedback  system  with  which  we  deal  is  shown  in  Figure  1. 


mi  — 


|  Ml 


»'  +  *1 


Pis) 


Figure  1.  Basic  feedback  system. 


For  this  system  the  usual  algebraic  manipulations  [8]  will  yield  tht  feed¬ 
back  system  gains 


«2<*>J  k2(4,^)  Mj(s). 


(2.1) 


where 


ft.,  is)  a,  is r 

_  t 

a 

-Pis)  -J 

*1*1  ‘1*2 

l-rp(Slc(S) 

1+plslcU) 

A.-  <*> 

m 

c(r) 

1 

‘1*1  *1*2 

L  l+p(i)c(j) 

l+p(s)cU) 

M*n  _  r 


Viw  bws)' 


where 


.*„.<*>  * 


v,,].r  *•*<*>  *•«<'' ->i 


C(S)  -pis)e(i) 

1  +p(s)c(s)  1  +p(s)c(s) 

P(s)ejs)  pis) 

1  +p(s)ds)  l+pis)ds) 


Of  course,  the  system  is  said  to  be  stable  if  each  of  the  eight  feedback  system 
gains  of  equations  2.1  through  2.4  is  stable.  Since  the  input/output  gains, 
km.  are  expressed  in  terms  of  the  input/plant-input  gains,  via  equation 

2.4  this  will  be  this  case  if  and  only  if  the  iaput/piant -input  gains  are  all 
stable. 

For  our  stabilization  theory  we  assume  that  a  coprirae  fractional 
representation  for  the  plant  is  given  in  the  form 

n0is)  „  _ 

pls)  .  as) 

where  np{s)  and  dpis)  are  stable,  dp(s)  is  not  identically  zero,  and  there 
exists  stable  up(s )  and  vp(s)  such  that 

u/s)np(s)  +  opis)dp(s)  m  i  (2. S) 

In  our  single-variate  setting  every  plants  admits  such  a  representation  and 
hence  we  may  assume  2.5  and  2.6  without  loss  of  generality.  Our  goal  is  to 
characterize  the  set  of  compensators,  represented  in  the  form 


where  ne(s)  and  de(s)  are  stable  and  de{s)  is  not  identically  zero,  which 
stabilize  the  feedback  system.  Of  course,  we  would  also  like  2.7  to  be  a  co¬ 
prime  fractional  representation.  Indeed,  so  as  to  prevent  (right  half-plane) 
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I* 


pole-zero  cancellation  between  p(r)  and  c(s)  we  require  the  stronger  condi¬ 
tion  that 


Pls)c(s)  ■ 


np(s)ncls) 

dp(s)dels) 


be  a  coprime  fractional  representation. 

Substituting  the  fractional  representations  pis)  « 
e(s)  *  ne(s)  / de(s )  into  2.2  and  2.4  now  yields 


(2.8) 


npis)/dp(s)  and 


/w,)‘ 

dpls)dels) 

-np(r)df(j)  ] 

dp(s)dtls)  +  it p(s)  nels)  dp(s)  dc(s)  +  np(s)  ne(j) 

and 

dpls)ntls) 

dpls)dels) 

dpls)dtls)  +  npls)ntls)  dpls)dels)  +  np(s)ne{s)  J 

*«<*> 

dpls)nels) 

-np(s)nc{s) 

dpls)dels)  +  np{s)ne{s)  dpls)dcls)  +  itp1s)nc(s) 

V  *W> 

* 

npls)ne(s) 

npls)dc(s) 

[  dplx)dtlp) + n/s)nf(s)  dp{s)dels) + np{s)nc(s)  J 

'Since  the  fractional  representation  for  p(s)c(s)  given  in  2.8  is  coprime 
there  exist  stable  p(s)  and  q(s)  such  that 

Pis)  ldpls)dels)  1  +  qls)  lnpls)nc(s)  1  -  1  ail) 

hence 

lpls)-qls))ldpls)dels))  +  qls)  [dpls)dt{s)  +npls)nels) )  -  1  (2.12) 

showing  that  the  fractional  representation  for  A,,,,  Is)  given  in  2.9  is  co¬ 
prime.  As  such,  it  follows  from  Property  1  that  A,,,,  (s)  is  stable  if  and  only 
if  the  common  denominator,  [dp{s)dc{s)  +/i,(s)nf(J)]  is  miniphase. 
Moreover,  since  A.,,,  Is)  must  be  stable  for  the  feedback  system  to  be  stable 
it  follows  that  [dpls)df{s)  +  np(s)ne(s) )  must  be  miniphase  for  the  system 
to  be  stable.  Conversely,  if  this  common  denominator  is  miniphase  the 
system  is  clearly  stable  via  2.9  and  2.10.  Moreover,  if  the  common 
denominator  is  miniphase 

(d(s)de(s)  +  n  (S)nels) )  -»  [d As )dt Is)  ] 

(2.13) 

+  [dpls)dels)  +fi,(x)*c(s)]',(<i,(x)nc(j)]  +  1 

showing  that  the  corresponding  fractional  representation  for  p(s)c{s)  is 
coprime.  We  have  therefore  proven  the  following. 


'  \*  V  V  V  o  i"  .  ■;  »■ 


s  ■-  v  •  -  • 
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S.  Property.  Let  p(s)  *  np(s)/dp(s)  be  a  coprime  fractional 
representation  for  p{s)  and  let  c(s)  ■  ne(s)/dc(s)  be  a  fractional 
representation  for  c(s).  Then  the  feedback  system  is  stable  and  p(s)cls) 
*  [n,(s)ne{s))/[dp{s)de(s)\  is  coprime  if  and  only  if 
[dp(s)dt{s)  +  i*,(a)nf(s)  ]  is  miniphase. 

Consistent  with  Property  S  the  goal  of  die  feedback  system  stabilization 
problem  is  to  characterize  the  compensators,  c(s)  ■  nc(s)  / de(s ) ,  such  that 
[dp(s)dc(s)  +npls)ne(s)]  is  miniphase  given  the  coprime  fractional 
representation 


where 

yfis)np{s)  +  op{t)dp  m  i  (2.15) 

for  some  stable  up(s)  and  vp(s). 

Stabilization  Theorem:  For  the  feedback  system  of  Figure  1  let  the 
plant  have  a  coprime  fractional  representation  as  per  equation  2. 14  and 
2.15.  Then  for  any  stable  w(x)  such  that  w(x)«f,(x)  +  vp{s)  is  not  iden¬ 
tically  zero  the  compensator 

( -  w(x)4L(x)  +  a  (a)  ]  «c(x) 

(w(x)*,(x)  +e,(s)J  dc[s) 

stabilizes  the  feedback  system  and  yields  a  coprimc  fractional  representa¬ 
tion  on  p(s)c(s)  m  [np (x) ne ls)}/[dp(s)dt(s)}.  Conversely,  every 
such  stabilizing  compensator  is  of  this  form  for  some  stable  w(x) . 

Proof.  According  to  Property  S  it  suffices  to  characterize  the  dass  of 
suble  ne(s)  and  de(s)  such  that 

d/s)d'W  +  np{s)ne(s)  -  k{s)  0-14) 

where  k[s)  is  an  arbitrary  miniphase  function.  To  this  end  we  wiD  attempt 
to  compute  all  possible  stable  solutions  to  equation  2.16.  Multiplying  equa¬ 
tion  2.15  through  by  k(s)  yields 

(*<x)ir,<x)]«,(s)  +  l k{s)vpW)dpis)  -  *U)  ai7) 

verifying  that 

af(r)  -  k(s)up(s)  (2.1») 

and 

df(s)  m  k(s)vp(s)  0.19) 

are  panicular  solution!  to  equation  2.16.  On  the  other  hand 


% 
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dp(s)  [n p{s)r{s) )  +  np(s)  { -  dp(s)r(s)  ]  -  0  (2.20) 

for  all  ruble  r(s)  showing  that 

«*(r)  ■  —dp(s)r{s)  (2.21) 

and 

dj(r)  •  »p(s)r(*)  002) 

are  homogeneous  solutions  to  2.16  for  all  suble  r(s).  It  remains  to  show 
that  2.21  and  2.22  represent  all  homogeneous  solutions.  To  this  end  ier 
n*(s)  and  d*(s)  represent  arbitrary  suble  homogeneous  solutions  to  2.16. 
That  is 

df(s)£(s)  +  np{s)n*(s)  -  0  0.23) 

in  which  case  we  will  show  that  they  take  the  form  of  2.21  and  2.22.  As  a  can¬ 
didate  for  r(s)  let  us  take  r(s)  -  -n*(s)  /  d#(s)  in  which  case  we  have 


-  -dp(s)r{s) 


verifying  2.21  and 


,  -  *m(s)a*(s) 

£<*)  - - jjf) - *  ap(s)r(s)  0.25) 

verifying  202.  It  thus  remains  to  show  that  r(j)  »  -  n*(s)  /dp(s)  is  suble 
for  which  we  have 

**U)  «J(s) 

'•(')  -  -  Y(s)  "  ~  TJsT 

**(s)a  <*)*  (t)  d(s)d*(s)u(s) 

-  -  /,./ - £<»•’/'>  -  r  -i  •  -«*(.)»,(') 


*  d*(s)up(s)  -  n*ls)vpls)  .  0-26) 

showing  that  r(s)  is  suble  since  it  is  expressed  as  the  sum  of  products  of 
suble  rational  functions.  As  such,  the  entire  solution  space  for  equation 
2.16  takes  the  form 

*»c(r)  •  i»*(s)  +«£(J)  ■  -r(s)dp(s)  +*(r)e^(x)  (2.27) 


df(r)  •  d*(s)+df(j)  ■  r(r)«p(r)  +  *(s)i>,(r) 
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where  k(s)  is  an  arbitrary  miniphase  function  and  r(s)  is  an  arbitrary 
stable  function. 

Now,  assuming  that  r(x)  and  k(s)  are  chosen  so  that  de(s)  is  not  idea* 
ticaily  zero  we  obtain  the  desired  set  of  compensators  in  the  form 

[-r(t)dAs)  +  k(t)uAs)  ] 


30* 


[r(s)np(t)  +k(s)v,(t)  J 

[  l -r<x) /kit)  )dAt)  ♦  «.(/)]  [-w(s>d  (x) +*  (i)] 

m  .1  i  ■  ■  i  .  1 1  r  ■■  £■■■■.. .  m  r  ■  r  (Jjf) 

[  [r(x)  /k(s)  l*#(x)  +  e#(x)  ]  [w<x)«#(s> + 

where  w (x)  •  r(s)/k(s)  spans  the  set  of  stable  rational  functions  such  that 
lMs)n  As)  +  o,(x)  ]  is  not  identicdly  zero. 

In  audition  to  giving  a  complete  parameter! ration  of  the  stabilizing  com* 
pensators  if  one  views  w(x)  rather  than  c(x)  as  the  underlying  design  para* 
meter  for  our  feedback  system  the  expressions  for  the  various  feedback 
system  gains  are  greatly  simplified.  This  follows  by  observing  that  the  com* 
pensator  of  the  theorem  yields  the  common  denominator 

[4tix)dt(t)  +  e,(x)»,(x)]  - 1  _  (2Jt) 

(since  we  have  divided  k(s)  out  of  the  expression  for  c(x) ).  As  such,  the 
denominators  in  Equations  2.9  and  2.10  drop  out  yielding  the  following  ex¬ 
pressions  for  the  feedback  system  gains  which  are  hnear  (actually  affine)  in 
the  design  parameter  w(x). 

6.  Corollary.  The  feedback  system  gains  which  result  from  the  use  of  the 
compensator  of  the  stabilization  theorem  t»if*  the  form 

LAj^U)  A^jUlJ  L  <sMj<xM-.,lxWr,fo  WU)*0MPU)+»,U)4,{I)  J 

and 

ra^u)  ^  r  -  »<i)x* u>  +  *,u>rf,u) 

Lv,u)  fcyjUJJ  "  J 

Proof.  These  relationships  result  Immediately  upon  substituting  2.30  and 
the  expressions  for  nc(x)  and  dt{s)  of  the  theorem  into  equations  2.9  and 
2.10. 

The  theorem  gives  a  complete  parameterization  of  the  stabilizing 
compensators  for  our  feedback  system  modulo  the  requirement  that 
"(s)np(s)  +v,(x)  not  be  identically  zero.  Needless  to  say,  in  our  single¬ 
variate  case  this  requirement  is  trivially  verified.  Moreover, 
**(x)n#(x)  +  v As)  is  always  non-zero  for  some  w(x)  hence  the  existence  of 
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a  stabilizing  compensator  for  any  single-variate  plant  is  guaranteed. 

7.  Corollary.  Every  single-variate  plant  admits  a  stabilizing  compensator. 
Proof:  Consistent  with  the  theorem  it  suffices  to  verify  the  existence  of  a 
stable  w(s)  such  that  w(r)«p(j)  +up(r)  is  not  identically  zero.  Indeed, 
either  w(s)  ■  -  1  or  w(r)  ■  0  suffices.  If  w(s)  •  - 1  fails,  i.e., 

de{s)  -  -np(s)  +  v/s)  m  0  (2.31) 

then  the  coprimes  ess  equality 


1  ■  up(s)np(s)  +  vp(s)dp{s)  ■  { up{r)+dp(r)]»p(r)  (2-32) 

implies  that  vp(r)  is  miniphase,  since  (up(s)  +  dp(s)  ]  is  a  stable  inverse  for 
pp(s) ,  in  which  case 

dc(s)  •  [0]np)+vp(s)  •  vp{s)  (2.33) 


is  not  identically  zero. 

Note  that  the  above  result  is  contingent  on  the  existence  of  a  coprime  frac- 
•  tional  representation  for  the  plant  and  therefore  may  fail  in  the  various 

generalized  settings  to  which  the  theory  can  be  extended  [8].  It  does, 
however,  hold  in  the  multivariate  case  wherein  a  coprime  fractional  repre¬ 
sentation  is  also  assured  to  exist  [11].  - 

Occasionally,  one  desires  to  design  a  compensator  which  is  a  proper  ra¬ 
tional  function;  i.e.,  c(«)  <  e»;  rather  than  simply  asking  for  a  stabilizing 


compensator  [2,  11  ].  Now,  c(s)  m  *ie(j)/de(s)  is  coprime  via  Equation 
2.30  hence  nc(e»)  and  dc(oe)  are  not  both  simultaneously  zero.  On  the 
•.other  hand  ne(»)  <  oo  since  ne(s)  is  stable  hence 

c(o»)  -  "'*m)  <  »  (2JO 

dc(») 

if  and  only  if  rfc( os)  m  w(oa )n^(a»)  +up(od)  ft  o.  Of  course,  in  this  case 
»(s)»p(s)  + 1 >p(s)  is  not  identically  zero  showing  that  the  proper  stabiliz¬ 
ing  compensators  take  the  form 


c(r) 

where  w(s)  is  stable  and 


-w{r)dp(r)  +up(s) 
»i*)np(s)  +vp(s) 


<2J5) 


*(•)«,(»)  +Pp(»)  *  0  (2J«) 

We  may  now  consider  two  cases.  First,  if  the  plant  is  strictly  proper, 
p(  os)  *  0,  then  np(co)  *  0  and  since  n  (s)  andup(r)  are  coprime  via  2.13 
implies  that  vp(»)  *  0.  As  such,  2.36  is  satisfied  for  all  stable  w(r).  On 
the  other  hand,  if  the  plant  is  not  strictly  proper,  p ( » )  M  0,  then  np(oo)  * 
0  in  which  case  2.36  reduces  to  w(«)  *  -  vp{o»)/np(a»).  We  have  thus 
verified  the  following  corollaries  [10,  11  ]. 
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S.  Corollar;.  If  p(s)  is  strictly  proper  then  the  set  of  compensators  given 
by  the  theorem  are  all  proper  and  well  defined  for  every  stable  w(s). 

9.  Corollary.  If  p{s)  is  not  strictly  proper  then  the  compensators  given  by 
the  theorem  are  well  defined  and  proper  if  and  only  if 


w(m)  *  ' 

Finally,  rather  than  simply  looking  for  a  proper  compensator  we  may 
desire  to  design  a  stable  compensator  [23].  Although  such  a  compensator 
does  not,  in  general,  exist,  a  criterion  for  the  existence  of  a  stable  stabilizing 
compensator  and  an  algorithm  for  its  construction  is  given  in  Section  10  as  a 
corollary  to  the  simultaneous  stabilization  theorem.  The  result  is,  however, 
far  from  elementary  and  no  parameterization  of  the  space  of  such  compen¬ 
sators  is  known  [17]. 


10.  Example.  For  the  plant  of  Example  4  with  the  coprime  fractional 
representation  of  of  Equations  1.12  and  1.13  the  required  set  of  stabilizing 
compensators  take  the  form 


with 


037) 


031) 


verifying  that  the  resultant  compensator  is,  indeed,  proper  given  a  strictly 
proper  plant. 

Now,  let  us  repeat  the  above  example  using  the  alternative  coprime  frac¬ 
tional  representation  of  Equation  1.14  and  1.15  which  yields 
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As  such,  the  seme  set  of  compensators  are  obtained  from  the  alternative 
coprime  fractional  representation  as  from  the  original  representation 
though  the  parameterization!  defined  differ  by  the  miniphase  factor 

Uf+iw/Uf+tf). 

Finally,  let  us  set  w'  (a)  *  0  in  2.39  obtaining  the  compensator 


clt) 


l«U+2) 

w*m 


0.41) 


Now.  c(s)  has  a  zero  at  s  »  -2  which  cancels  the  pole  of  p(s)  at  s  ■  >2. 
This  does  not,  however,  contradict  the  requirement  that  p(a)c(s)  ■ 
In ,(s)nt(s))/[d0{s)dt(s))  be  coprime  since  our  coprimcness  concept 
only  forbids  right  half-plane  pole-zero  cancellations.  Of  course,  a  left  half- 
.  plane  pole-zero  cancellation  such  as  encountered  in  the  above  example  is 
benign  and  need  not  be  forbidden. 


3.  Tracking 

Once  we  have  stabilized  our  feedback  system  we  may  use  the  remaining 
design  latitude,  the  choice  of  a  stable  w(j),  to  meet  various  system  design 
constraints.  The  first  of  several  such  design  constraints  which  we  will  con¬ 
sider  are  the  asymptotic  tracking  and  disturbance  rejection  conditions 
wherein  we  require  that  the  system  asymptotically  follow  or  reject  a 
prescribed  input  [4]. 

In  the  tracking  (or  asymptotic  regulator )  problem  it  is  desired  to  design  a 
stable  feedback  system  whose  output,  v2,  asymptotically  follows  a  pre¬ 
scribed  input  which  we  model  by  the  impulse  response  of  a  transfer  function 
((*),  as  illustrated  in  Figure  2.  As  usual  we  assume  that  t(s)  admits  a 


Figure  2.  Feedback  system  with  tracking  generator. 
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(3.1) 


coprime  fractional  representation  in  the  form 


where  there  exist  stable  u,(r)  and  u,(s )  such  that 

it;(j)ff,(s)  +  of(r)rf,(x)  -  1  (3.2) 

We  say  that  the  system  tracks  the  impulse  response  of  tit)  if 

tit)  -  Ay,  is)  tit)  -  A»J#J  (x)r(x)  OJ) 

is  sable.  Recall  that  the  impulse  response  of  a  .single- variate  system  is 
asymptotic  to  zero  if  and  only  if  the  corresponding  transfer  function  is 
stable.  Thus  the  response  of  our  system  to  the  impulse  response  of  t(s)  will 
be  asymptotic  to  the  impulse  response  of  t(s)  if  and  only  if  the  transfer  of 
equation  3.2  is  stable. 

Recall  from  Corollary  6  that  “ 

Ay,(s)  ■  »is)s0(s)d0is)+»p(s)dp(s)  0.4) 

hence  if  we  desire  to  stabilize  the  system  and  simultaneously  cause  it  to  track 
the  impulse  response  of  t(s)  we  must  choose  a  stable  w(s)  such  that 
w(s)ttp(s)  +  v/(s)  is  not  identically  zero  and 

,  .  !»<*)« u(s)d-(s)+e  (s)d  (x)K(s) 

At,*,  (S)t(s)  m  -  —  : -  OS) 

is  sable. 

11.  Property.  Given  pis)  there  exists  a  compensator  for  the  feedback 
.system  of  Figure  2  which  stabilizes  the  system  and  simultaneously  causes  It 
to  trade  the  impuslse  response  of  t{s )  if  and  only  if  the  equation 

w(i)»#(r)rf,ti)+x(i)rf,(r)  m  «,{x)«,(x)~ 1 

admits  stable  solutions  w (s)  and  x(s)  such  that  w(x)ff,(x)  +  upis)  is  not 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

(-w(x)sr,(f) +»,(«)) 

C(s)  ■  »  '  ■  ■  r  ■ ■  r  mi 

[»is)nftls)+spis)] 

where  w(s)  is  a  solution  to  the  above  equation. 

Proof.  If  there  exists  a  stable  w(s)  such  that  3.5  is  suble  then  it  follows 
from  3.2  that 

[»is)n0is)d0{s)  +  u0is)d0(s)  J 
d,is) 
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[w(s)np(s)dp(s)  +  vp(s)dp(s))  (u,(*)«i,(s)  +  v, (s)d,(s)  1 
d,(s) 


0.« 


f  I  +  vAs)dAs)  ]n,{s)  1 

»  - - - -  ■  -* - - - J  «,(/)  +  («*(x)n,(x)rf/U)  +  vp(s)dp(t)  !*,(!) 

-  -*U> 

is  stable  since  it  is  expressed  as  the  sum  of  products  of  stable  functions.  Re¬ 
arranging  3.6  and  invoicing  2.15  then  yields 

Ms)np(s)dp(s)  +x{t)d,{s)  -  -  trp{s)dpis)  ■  up(s)np{s)  - 1  (3.7) 

as  required.  Conversely,  if  3.7  admits  stable  solutions,  w(a)  and  jr(a), 
where  w(s)np{s)  +  vp{s)  is  not  identically  zero  we  define  e(s)  by 


c(s) 


I  -  »(*)dpls)  +*pls)  J 


Iw(j)i»#(j)  +  »j(x)1 
using  the  w(s)  of  3.7.  Now,  with  this  w(j)  3.7  and  3.S  imply  that 
.  .  _  t"(x)  *f,W  +  »0Wdp{s)]*,ls) 

^j(s)rls)  - 

•-  l J !»,(!) 


0.t) 


C3.f) 


d,(s) 


-X(l)«l,(S) 


is  stable.  Since  the  stabilization  theorem  implies  that  any  c{s)  in  the  form  of 
3.8  stabilizes  the  system  while  3.9  implies  that-A,|^I(x)t(j)  is  stable  for  this 
choice  of  w(x)  we  have  constructed  the  desired  compensator. 

Tracking  Theorem:  Given  p{s)  there  exists  a  compensator  for  the  feed¬ 
back  system  of  Figure  2  which  stabilizes  the  system  and  simultaneously 
causes  it  to  track  the  impulse  response  of  f(s)  if  and  only  if  n As)  and 
d,  (s)  are  coprime.  In  this  case  let  upl(s)  and  v^(s)  be  stable  functions 
such  that 

+  »pl(s)d,(s)  -  1 

and  let  a(s)  =*  nt(.s)/dt(s)  be  a  coprime  fractional  representation  of 
a(j)  »  dp(s)  /d,(s).  Then  the  desired  set  of  compensators  take  the  form 


where 


\-y»[s)dp{s)  +up(s)J 
ClS)  “  Ms)npis)  +  vp(s)} 


w(s)  •  -upl(s)vpls)-¥e{s)dt(s) 
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with  e(s)  an  arbitrary  stable  function  such  that  w(s)/i  (s)  +  t^(s)  is  not 
identically  zero. 

Proof.  Consistent  with  Property  11,  it  suffices  to  show  that  the  equation 

»(s)xp(s)dp(s)  +x(s)dr(s)  rn  up(s)np(s)  - 1  0.10) 

admits  stable  solutions  w(s)  and  xU)  such  that  w(s)np(s)  +  v As)  is  not 
identically  zero  if  and  only  if  np(s)  and  d/s)  are  coprime.  In  this  case  we 
then  show  that  the  appropriate  w  (s)  takes  the  form 

W(I)  rn  -Um(s)vp{s)  +t(S)d'{S)  ail) 

for  any  stable  els). 

If  3.10  admits  stable  solutions  w(s)  and  x(s)  then  it  follows  from  3.10 
that 


",(s)np(s)  -  »(s)ttp(s)dp(s)  -x{s)d/s)  •  t  0.12) 

or  equivalently 

|a,|f)  -  "Uid/s)  ]np{s)  ♦  [-x(r)  )d,(s)  -  I  0.13) 

showing  that  n/s)  and  d/s)  are  coprimc.  Conversely,  if  np{s)  and  d/s)  are 
coprime  there  exists  u0(s)  and  u0(s)  such  that 

if^(j)»#(r)  +vp,ls)d,(s)  -  i  _  0.14) 


from  which  it  follows  that 
"p(s)"p(s)-l  »  - *p(s)dp(s ) 

■  ”  1*0,1*)"/*)  + *mis)d/s)vp(s)  ]v/s)d/s)  0-15) 


As  such. 


and 


■  \-  yp,{s)u/s))n/s)d/s)  +  i-om(s)9/s)dp{s)  ]d,(x) 

w4{s)  m  -upt{s)vp{,x)  0.14) 


xd{s)  ■  -  vm{s)v/s)d/s)  0.17) 

represent  particular  solutions  to  3.10. 

To  construct  homogeneous  solutions  3.10  we  define  a  transfer  function 
a(s)  by  a(s)  •  d/s) /d/s )  and  let 


*<r) 


0.lt) 


be  a  coprime  fractional  representation  for  e(s).  It  then  follows  from  Pro¬ 
perty  2  that  there  exists  a  stable  k(s)  such  that 
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dp{s)  -  ntls)k(s)  (3.19) 

and 

d,(s)  -  (3.20) 

Thus  if  we  define  candidates  for  the  homogeneous  soiutionion  of  3.10  by 

H^(f)  -  e(s)dt(s)  (3.21) 

and 

x*{s)  -  -e(s)np(s)rtt(s)  (3.22) 

where  e(s)  is  an  arbitrary  stable  rational  function  we  have 
^(s)np(s)dp(s)  +r*(r)d,(r) 

(3.23) 

-  e(s)d'(s)np(s)nt{s)F(s) -e(s)np{s)ntls)dals)k{s)  •  0 

verifying  that  3.21  and  3.22  are,  indeed,  homogeneous  solutions. 

To  complete  the  solution  of  3.10  we  must  show  that  all  homogeneous 
solutions  are  of  the  form  3.21  and  3.22.  To  this  end  assume  that  w*(s)  and 
x*(s)  are  stable  and  satisfy 

]?U)"pis)dpls)  +2*ls)dt(s)  m  0  -  (3.24) 

and  define  e(s)  by 

*(s)  -  »?(i)d'(s)  (3.25) 

Qearly, 

w*(r)  -  «(s)d,(»)  (3.26) 

while  it  follows  from  3.24  that 

-w*ls)n As)d _<s) 

**U>  -  - — 2 —  -  -w*(s)/y»(r) 


-J»*(<)ff,(r)«,(r) 


- e(s)np{s)n,(s ) 


showing  that  w*(r)  and  x*(s)  have  the  form  of  3.21  and  3.22.  It  remains, 
however,  to  verify  that  e(r)  is  stable  for  which  purpose  we  invoke  equations 
3.14  and  the  coprimeness  of  nt(s)  and  de(s)  from  which  it  follows  that 
there  exists  stable  ua(s)  and  va(s)  such  that 

ut(s)nt(s)+ut(s)dt(s)  -  1  (3 .23) 
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Ax  such. 


*U)  » 


~d^s) 


w*(s) 


(«#U)«,(r)  +vtls)dt{:)\ 


wf,{S)ut(S)Ha(S) 

5*) 


*  w*(s)v,(s) 


.  w*(X)ll#{s)rf  (l)  . 

W*<S)Ktf(X)«<X)  +  W*(X)V,tf)  -  ■= - y'-fl-g -  +  w*(x)u,(x) 


[w*(x)v.(x)rf_(x)l 

- g—p2 — I  (*,,(x)/i,(s)  +**(*)*, U>]  +  f*W»,U) 


0J») 


w*(x)v#(x)rf#(x)Mj,(x)A,(x) 

2*jf 


+  W*(X)uc(X)rf,(X)»„(X)  +  w*(X)w#(X) 


■  -x*(x)i»#(x)«^(x)  +^U)u§(s)dp(s)vm(s)  +  w*[s)otW 

is  stable  since  we  have  expressed  it  as  the  sum  of  products  of  stable  func¬ 
tions.  Note,  the  last  equality  in  3.29  follows  from  3.24. 

The  solution  space  for  Equation  3.10  thus  takes  the  form 

w(x)  •  -«^(s)»#(xH»«U')d#(s)  (3 JO) 

and 

JT(X)  «  -»,,(/) V,(X)rf,(X)-«(X)A,(X)<l4(X)  (3J1) 

As  such.  Property  11  implies  that  if  the  w(x)  of  Equation  3.30  is  used  to 
define  a  stabilizing  compensator  as  per  the  stabilization  theorem  it  will  also 
cause  the  system  to  track  the  impulse  response  of  T(s) .  Of  course,  we  must 
also  assume  that  e(x)  is  chosen  so  that  w(s)n,(x)  +  vAs)  is  not  identically 
zero.  To  complete  our  proof  that  the  coprimeness  o(np(s)  and  d,(s)  is  a 
sufficient  condition  for  the  solution  of  the  tracking  problem  it  thus  suffices 
to  show  that  there  exists  at  least  one  choice  of  e(x)  such  that 
"(s)np(s)  +  vp(s)  is  not  identically  zero.  From  3.30  it  follows  that 

w{s)np(s)  *  vp{s)  ■  [  -  upt{t)np(s)  + 1  ] vp{s)  +  t(s)dp(s)np(s)  C3 J2)' 

Now,  dt(s)  is  not  identically  zero  since  it  is  the  denominator  of  a(s).  As 
such,  if  np(s)  is  not  identically  zero  it  follows  from  3.32  that  the  value  of 
w(s)np(s)  +  vp(s)  is  a  non-trivial  function  of  e(s)  and  is  therefore  not 
identical  to  zero  for  all  e(x).  On  the  other  hand  if  np(s)  ■  0  then  2.15  im¬ 
plies  that  up(s)  is  miniphase  which,  in  turn,  implies  that  w(s)np(s)  +  vp(s) 
*  vp(s)  is  not  identically  zero.  Our  proof  is  therefore  complete. 

Although  the  proof  of  our  theorem  is  long,  though  elementary,  the  basic 
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result  to  the  effect  that  a  compensator  exists  which  will  simultaneously 
stabilize  the  system  and  cause  it  to  track  the  impulse  of  r(s)  if  and  only  if 
np{s)  and  d,(s)  are  coprime  is  simple  to  check  (no  common  right  half-plane 
zeros).  Moreover,  the  construction  of  the  required  compensator  is  simply  a 
matter  of  substitution  as  per  the  following  example. 

12.  Example.  Consider  the  problem  of  designing  a  compensator  for  the 
plant  of  Examples  4  and  10  so  that  the  system  is  stable  and  asymptotically 
tracks  a  step  function.  Recall  that 


’  (S+l)‘ 

(s  +  2/3) 

(s— 2) 

.(s  +  2)* 

(s+2) 

(S  +  2) 

While  we  take  f(s)  =  1/s  from  which  we  obtain 


/(j)  »  — 


[_!_] 

1  L<*+2)J 

s  “  ,  "  d,l 

.<s+  2) 


Now,  np(s)  has  no  right  half-plane  zeros  while  d,(s)  has  a  right  half-plane 
zero  at  s  «  0.  As  such,  np(s)  and  dt(s)  are  coprime  and  according  to  the 
theorem  the  desired  compensator  exists.  Indeed, 

[4]  [£$  *  [uTsl  fcj*]  '  -  -  »•*> 

As  a  final  step  in  the  construction  of  our  compensator  we  let 


[—1 

dpW  _  [(s  +  2) j  _  n/s) 
<*,(s)  s 

.(s+2) 


which  is  coprime  since 

-«1  ^1  +  [2]  [—^—1  «  "a(s)nAs)+vAs)dAs)  (3.3 
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It  thus  follows  from  the  theorem  that  the  desired  w(j)  takes  the  form 


*(*)  -  +  e{s)da(s) 


-40  +  2/3)' 

*  1 

(x+2) 

** 

e(s) 


(3.39) 


where  e(s)  is  an  arbitrary  stable  function.  Substitution  of  any  such 
w(x)  mto 


e(s)  •  l~wls)dp(s)+utWl 
lw(x)n,(x)  +  op{s) ) 


0.40) 


thus  defines  the  required  compensator  so  long  as  e(s)  is  chosen  so  that 
w(s)/tp(s)  +  vp(s)  is  not  identically  zero.  To  verify  our  solution  we  may 
substitute  w(x)  into  the  formula  for  A,,*,  (x)  of  Corollary  6  obtaining 

A.,*,  (x)  •  I — £— 

Since  A,|#|  (x)  has  a  zero  at  x  *  0  the  required  tracking  property  may  now  be 
'  verified  by  the  final  value  theorem. 

Now  let  us  consider  an  alternative  problem  where  we  are  required  to 
track  e^U  (/).  Here  our  tracking  generator  is  defined  by 


(x(x— 2)(r+2/3)  +  (f+!)(s-Z)e(s)]  0.41) 


As  before  n#(x)  and  dt(s)  are  coprime  since 


m  ijiiiij + r  (£+2£)i  r<x-2)i 

L3jL(x+2)jJ  L  (x+2)  J  L(x+2)J 


+Vplls)dl(s) 


Finally,  for  this  example  we  have 


where 


«(x) 


Co-2)' 

m  (t+ 2).  _  l 
d,(x)  *  (x  -  2)  I 

.(x  +  2). 


«,(x) 

~dJ7) 


0.42) 


1  0.43) 


0.44) 


[onn  +  mm  m  "tis)"ais)+oa(s)da(s)  •  1  o.ss) 

Note  that  in  this  example  dp{s)  and  d(  (x)  are  not  coprime  since  they  have  a 
common  right  half-plane  zero  at  x  «  2.  For  our  purposes,  however,  all  that 
is  required  is  a  coprime  fractional  representation  for  a(s)  =*  dp(s)  / d,(s )  as 
constructed  above.  Using  these  new  values  for  upi(s)  and  da(s)  we  obtain 
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16(j  +  2/3)' 

-<sl  *  -  bu«rj +  ,<s’ 


for  any  stable  e(s).  This,  in  turn,  yields 


(*-2)  ]  ,  , 

,U>  ■  9(772?  U»»J- 6^-201 -8)  +  9(5+lHJ  +  2)e(s)l  (3.47) 


for  which  the  zero  at  s  *  2  indicates  tracking.  Note  that  every  stable  w(s )  is 
obtained  for  some  e(s)  and  hence  all  stabilizing  compensators  track 
e^UO)  in  this  example.  -  * 


4.  Disturbance  rejection 

There  are  two  alternative  disturbance  rejection  problems  which  arise 
naturally  in  our  feedback  system  theory.  Figure  3a  indicates  the  configura¬ 
tion  for  the  input  disturbance  rejection  problem  {16]  wherein  we  desire  to 
design  a  compensator  which  simultaneously  stabilizes  the  system  and  causes 
it  to  asymptotically  reject  the  impulse  response  of  r(s).  i.e.,  the  response  of 
the  system  to  the  impulse  response  of  r(s)  should  be  asymptotic  to  zero. 


Figure  3.  Feedback  system  configuration  for  a)  the  input  disturbance  rejection  prob- 
blem,  b)  the  output  disturbance  rejection  problem,  and  c)  a  modified  configuration 
for  the  output  disturbance  rejection  problem. 
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A  similar  output  disturbance  rejection  problem  [4]  is  illustrated  in 
Figure  3b.  Here,  the  distrubance  is  injected  into  the  system  at  the  plant  out* 
put  and,  as  before,  it  is  desired  to  design  a  compensator  which  simul¬ 
taneously  stabilizes  the  system  and  causes  it  to  asymptotically  reject  the  im¬ 
pulse  response  of  r(s) .  Surprisingly,  however,  the  output  disturbance  rejec¬ 
tion  problem  is  completely  equivalent  to  the  tracking  problem  considred  in 
the  previous  section.  To  see  this  simply  observe  that  the  block  diagram  of 
Figure  3c  is  equivalent  to  that  of  Figure  3b.  As  such,  if  we  design  a  compen¬ 
sator  to  stabilize  the  system  and  cause  the  planr  output  to  asymptotically 
track  the  impulse  response  of  -r(s)  when  the  impulse  response  of  r(s)  is 
added  to  the  plant  output  the  total  effect  of  the  disturbance  observed  at 
will  be  asymptotic  to  zero.  Consistent  with  the  above  we  give  no  further 
consideration  to  the  output  disturbance  rejection  problem  since  it  may  be 
resolved  via  the  techniques  of  the  previous  section  with  t(s)  ■  -  r(s ) .  In¬ 
deed,  one  can  solve  the  tracking  and  output  rejection  problem 
simultaneously  by  working  with  tracking  generator  t(s)  —  r(r). 

For  the  input  disturbance  rejection  problem  we  require  that  the  impulse 
response  of 

AV|  (s)r(s)  m  [  -  w{s)n0(s)dpis)  +  up(s)np(t)  ]/■(*)  (4.1) 

be  asymptotic  to  zero.  Hence  to  simultaneously  stabilize  the  feedback 
system  and  cause  it  to  asymptotically  reject  the  impulse  response  of  r{s)  we 
must  choose  a  stable  w(x)  such  that  w {s)np{s)  +u,(s)  is  not  identically 
zero  and  (s)r(s)  is  stable.  The  required  theory  [10]  is  essentially  iden¬ 
tical  to  that  used  to  solve  the  tracking  problem  and  hence  we  simply  state 
'*■  the  pertinent  theorems  without  proof.  For  this  we  let  r(x)  «  nf(s)  /dr{s)  be 
a  coprime  fractional  representation  for  .'(r). 

13.  Property.  Given  p{s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  3a  which  stabilizes  the  system  and  simultaneously  causes  it 
to  reject  the  impulse  response  of  r(s)  if  and  only  if  the  equation 

w(x)n#(r)d#(r)  +  y{s)d,{s)  ■  up{s)np{s) 

admits  stable  solutions  w(x)  and  y{s)  such  that  w(s)np(s)  +  vp (s)  is  not 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

t-w(s)d 0<s)+y,  (s)J 

C(s)  m  . — g - C-  

l»Cr)«,C*  >+%(*)] 

where  w(s)  is  a  solution  to  the  above  equation. 

Disturbance  Rejection  Theorem.  Given  p(s)  there  exists  a  compen¬ 
sator  for  the  feedback  system  of  Figure  3a  which  stabilizes  the  system 
and  simultaneously  causes  it  to  reject  the  impulse  response  of  r(r)  if  and 
only  if  dp(s)  and  dr(s)  ire  coprime.  In  this  case  let  upr(s)  and  ei^(r) 
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be  suble  functions  such  that 

uprls)dpls)  +  i/pf(s)rf,(s)  -  l 

and  let  bis)  m  nb(s)/db(s )  be  a  coprime  fractional  representation  of 
b(s)  *  np  ( s)  !dr(s) .  Then  the  desired  set  of  compensators  take  the  form 

,  „  [  -  *ls)d  is)  +  u  Is)  J 

els)  m  — ■ - c - c - 

[w{s)nf(s)  +  vp(s)\ 

where 

w(s)  ■  upfls)upls)  +  fls)dbls) 

with  f(s)  an  arbitrary  suble  function  such  that  w (s)np(s)  +  vp(s)  is  not 
identically  zero. 

14.  Example.  Continuing  with  the  plant  of  Example  12  let  us  consider  the 
problem  of  designing  a  compensator  to  reject  a  step  function,  i.e.,  we  let 


1 

fil  .  .  <*+2>. 

UJ  S  drls) 
.(*+2). 


H  [sTi]  *  H  [i777,]  -  * 1  «*•» 

showing  that  dp(s)  and  dr(s)  are  coprime.  Finally,  we  let 

iilii 

«Js)  ls+2)1  "tW 

bis)  -  .2—  -  y'—-’  -  J—  (4.4) 

dtls)  s  dbls) 

.  <*+2)  . 

which  is  clearly  coprime.  From  the  theorem  the  required  w(j)  take  the  form 

»(r)  *  u^Wu^s)  +fls)dbls)  •  j~”j  +  [(777)  ^ s) 

where /(s)  is  arbitrary  stable  function  such  that  w[s)np(s)  +  vp{s)  is  not 
identically  zero.  The  use  of  the  compensator  derived  from  this  w(s)  then 
results  in  the  gains 

j(r  + 1)]  32 

hr  (#)  -  - -  —  (j+2)-(i-2)/(j)  (4.4) 

'  .(r  +  2)\  .3  . 
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and 


^tjHj  (i) 


( s-2 ) 
.Hr+2)4. 


l(x+2X3xJ-Si-12)  «■  3x(x+  l)/(x)  1 


(4.7) 


Here,  the  fact  that  (x)  has  a  zero  at  s  *  0  indicates  that  the  disturbance 
rejection  specification  us  been  achieved  while  the  fact  that  h,  (x)  has  a 
zero  us  m2  implies  that  the  system  also  tracks  «*U  (f).This  Is  consistent 
with  Example  12  for  the  system  tracks  e^Uf/). 


S.  Simultaneous  traddaf  and  dlstnrbance  rejectioa 

The  purpose  of  this  section  is  to  combine  the  results  of  the  previous  two  sec* 
tions  by  formulating  criteria  for  the  design  of  a  compensator  which 
simultaneously  stabilizes  the  system,  causes  it  to  track  the  impulse  response 
of  Us)  and  causes  it  to  reject  the  impulse  response  of  r{s)  [16].  The  ap* 
propriatc  feedback  system  configuration  is  shown  in  Figured  where  r{s)  is 
taken  to  be  an  input  disturbance.  Of  course,  an  output  disturbance  can  also 
be  included  in  the  theory  simply  by  combining  it  with  the  tracking 
generator. 


Flgmt  4.  Configuration  for  the  simultaneous  tracking  and  disturbance  rejection 
problem. 


For  consistency  with  the  previous  sections  we  will  use  the  same  notation 
which  is  reviewed  as  follows.  Our  plant  is  assumed  to  have  a  coprune  frac¬ 
tional  represention 

"As) 

p{t)  m  -2 —  (5.1) 

d,ls) 

such  that 


»;(i)s,(x)  +  ii,(x|^(x|  ■  1  (5.2) 

while  the  tracking  and  rejection  generators  are  characterized  by  coprime 
fractional  representations 
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and 


r(s)  - 


«,(*> 

d,(s) 


Also  as  in  the  previous  sections  we  define  a  is)  and  bis)  by 


sis) 


d/s)  n/s) 
d/s)  d/s) 


and 


bis) 


d/s)  n/s) 
d/s)  "  d/s) 


(5.4) 

(5.5) 

(5.5) 


where  a(s)  »  n/s) /n/s)  and  b(s)  «  n/s) /d/s)  are  coprime  in  the 
sense  that  there  exists  stable  u#(s),  v/s).  u/s),  and  vb(s)  such  that 

u/s)n/s) +v/s)  d/s)  •  i  (5.7) 


and 


u/s)  n/s)  +  v/s)  d/s)  -  1  (5.8) 

Moreover  it  follows  from  S.5  and  5.6  together  with  property  2  that  there 
exists  stable  k(s)  and  mis)  such  that 

d,is)  -  d/s)kis)  and  d/s)  -  n/s)kis)  "  (5.8) 

and 

d/s)  m  d/s)m{s)  and  n/s)  -  n/s)m{s )  (5.10) 

Finally,  the  coprimeness  conditions  for  the  tracking  and  disturbance  rejec¬ 
tion  problems  are  characterized  by 


uplis)n/s)-t-vplis)d/s)  -  1 

(5.11) 

and 

upris)d/s)  +  vpr{s)d/s)  m  i 

(5.12) 

while  we  will  also  require  a  coprimeness  condition  between  d/s)  and  d/s) 
which  we  characterize  by  the  equation 

u/s) d/s)  +  v/s)d/s)  m  i 

(5.13) 

With  this  review  of  notation  in  hand  the  required  design  equations  for 
the  simultaneous  tracking  and  disturbance  rejection  problem  can  be  ob¬ 
tained  simply  by  combining  the  results  of  Property  11  and  Property  13  and 
observing  that  both  design  equations  must  be  satisfied  by  the  same  w(s) 
since  we  desire  to  construct  a  single  compensator  which  simultaneously 
solves  both  problems. 
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IS.  Property.  Given  p{s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  4  which  stabilizes  the  system,  causes  it  to  track  the  impulse 
response  of  t{s)  and  simultaneously  causes  it  to  reject  the  impulse  response 
of  r(s)  If  and  only  if  the  pair  of  equations 

w(x)*#(x)d#(x>  ♦x(f)d,(r)  ■  e,(r)*,(x)  - 1 
and 


*is)nfls)d0(s)  +/U)rf,(i)  «k,(x)*#(x) 

admit  stable  solutions  w(s).x(s).  and  y(s)  such  that  w{s)np(s)  +  o (r)  is 
not  identically  zero.  In  this  case  Use  required  compensator  takes  the  form 

I  -w(x)d.(x)  +11  (x)J 

«<x)  - - 1 - 1 - 

I"  (r  J 

where  w(x)  is  a  solution  to  the  above  equations. 

Simultaneous  Tracking  and  Disturbance  Rejection  Theorem:  Given 
pis)  there  exists  a  compensator  for  the  feedback  system  of  Figure  4 
which  stabilizes  the  system,  causes  it  to  track  the  impulse  response  of 
Us)  and  simultaneously  causes  it  to  reject  the  impulse  response  of r(s)  if 
and  only  if  " 

0)  n  As)  and  d,(s)  are  coprime, 

(U)  a  As)  and  dr(s)  are  coprime,  and 
Qii)  atis)  and  dt{s)  are  coprime. 

In  that  case  the  desired  set  of  compensators  take  the  form 

I-w  (x)dL(x)+v  (x)] 

c(r)  - - 2 ' - 

(w<s)i»,<x)  +  e#(s)J 

where 

w(s)  •  (« »0r(s)uf(s)ulr(s)dl(s) -um{s)v,(s)o„{s)dr(s))+t{s)db{s)dais) 

with  gis)  an  arbitrary  stable  function  such  that  w (r)n#(r)  +  v0(s)  is  not 
identically  zero. 

Proof.  The  fact  that  n0(s)  and  d,(s)  must  be  co prime  follows  from  the 
tracking  theorem  while  the  fact  that  dp(s)  and  dr(s)  must  be  coprime  follows 
from  the  disturbance  rejection  theorem.  To  verify  that  d,(s)  and  dr[s)  must 
also  be  coprime  for  simultaneous  stabilization  we  subtract  the  two  design 
equations  of  Property  15  obtaining 

l r(s))d,ls)  +  -  I  (5.14) 

Conversely,  to  show  that  the  three  coprimeness  conditions  are  also  suffi¬ 
cient  we  must  construct  a  w(r)  which  simultaneously  satisfies  the  criteria  for 
tracking  and  disturbance  rejections  derived  in  the  preceeding  sections. 
Upon  invoking  the  results  of  the  tracking  and  disturbance  rejection 
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theorems  we  must  therefore  solve 

”"*»(*>* :pis)+e(s)dt{s)  -  w(r)  ■  v^(s)m/,(s)  +/(s)db(s)  (5.15) 

for  stable  *(s),f(s),  aod  w(s).  Since  the  required  set  of  stable  w(r)  may  be  ob¬ 
tained  by  substitution  once  e  (s)  and/(s)  have  been  parameterized  our  main 
problem  is  to  characterize  the  stable  solutions  of 

*(s)da(s)  ~/(s)dp(s)  -  (iy(s)v,(s)  +upl(s)vpis)  ]  (5.16) 

To  obtain  a  particular  solution  for  5.16  we  invoke  5.9,  5.10,  and  5.13 
obtaining 

lup,{s)up(s)  +upl(s)vpls)  ] 

»  lupr(s)up{s)  +  npl(s)op(s) )  ( utr(s)d,(s)  +  vu(s)d,(s)  J 

*  (Vp,(f)«p(r)  +  w„(i)v,(*)  lT«,r(r)*(s)d,(s)  +  v,rls)m(s)di(.s)  1 

*  (l "prWUpls)  +upl(s)vp(s)]ulr(s)k(s))dt{s) 

+  [lypr(s)yp{s)*uplls)vp{s))vlr{s)m(s)}dbls)  (5.17) 

As  such,  the  required  particular  solutions  take  the  form 

«*(*)  »  l»»pr(s)up{s)-¥uptU)vp{s))uv(s)k(s)  (5.1*) 

and 


.Of  course, 

and 


/»(s)  «  -  +upt(s)vp(s)]vlr(s)m(s) 

**(»)  ■  g(s)db{s) 


(5.19) 

(5.20) 


/*(*)  -  tis)da[s)  (5.21) 

represent  homogeneous  solutions  to  5.16  for  any  stable  g{s)  since 

e*(j)</,(j) -/*(s)d4(s)  -  g{s)dbls)dt[s)-g(s)dt(s)db(s)  -0  (5.22) 

As  such,  our  characterization  of  the  solution  space  for  5. 16  will  be  complete 
if  we  can  verify  that  all  stable  homogeneous  solutions  to  5.16  take  the  form 
of  5.20  and  5.21  for  some  stable  g(s).  To  this  end  Iet_e*(s)  and /*(z)  be  ar¬ 
bitrary  stable  homogeneous  solutions  to  5.16,  i.e.,  “ 

j*(j)tf4(r)  -/*(s)dt(r)  -  0  (5.23) 

Now,  define  g(s)  by  g(s)  =  e*(s)  /db(s)  in  which  case 

.£*(*)  *  g(s)db(s) 


(5.24) 


while  3.23  implies  that 


e*{s)dtls) 


*  *(x)d,( x) 


showing  that_e*(j)  and/* (5)  are  of  the  required  form.  It  remains,  however, 
to  show  that  t(s)  is  stable.  Indeed, 

,  %  **(*)  **<*> 

*<*>  *  T7T  *  T - l**W«gCe>  ♦*(#)**(!)  1 

■*w  «>U) 


j*(x)itA(x)e»(x) 


«*(X)M,(x)fl (S) 

■  .£*<*) »*<x)  ♦ - - - 1 —  lw„{x)d,(x)  +  »„{s)d,(s) ) 

«,U). 


•j*(s)vt(s)  +**<s)t‘t(s)af(s)ulr(s)  + 
*  «£*<*)"*<*)  +e*(x)Kt(x)fl/(x)vlr(s)  + 


.£*(*)  (x)  (x)  ( x)  d,  (s) 

d^x) 

/  (x)  u^(s)nb(s)  u^fsid^isiJtd) 
______ 


«_x*(x)»t(x)  +£*<x)i»,<x)*,<x)e„Cx)  +/* (X) (x) %(x) u^x) k (x)  CS.20 

showing  that  g(x)  is  stable  since  it  has  been  expressed  as  the  sum  of  prod* 
ucts  of  stable  functions.  Here,  Equation  3.26  was  derived  with  the  aid  of 
Equations  3.6,  5.S,  5.9,  5.13,  and  5.23.  As  such,  the  complete  set  of  solu¬ 
tions  to  5.16  take  the  form 

•is)  m  [«/„(j)«p(x)  +*»p,(*)up(x}  jv^dXrtx)  +*(x)dk(x)  (S^7) 


/(x)  -  [v(x)u,(s)  +  v„(x)ii»,(x)]  vlr(s)m(s)  +g(s)d,(s)  <5J») 

Now,  upon  substituting  either  of  these  expressions  into  5.15  we  obtain  the 
desired  expression  for  w(x)  in  the  form 


w(s)  rn  -um(t)vf{s)  +upr(s)up{s)  +^{x)B#(x)u„(x)*(x)d#(x)  +g(x)dj>(x)d#(x) 

*  y0ls)up(s)v„(s)dt(s)  +U0(s)vp(s)vu(r)dl(s)  -  1  ♦  g(x)dk(x)d,(x) 

•  \''prl*)>‘pi*)ulrls)df{s) -"0W»pls)v„is)drls)]+ns)db(s)dals)  <5.29) 


Fecosace  System  Design 


where  g  (s)  is  an  arbitrary  function. 

Finally,  to  complete  the  proof  that  the  three  coprimeness  conditions  suffice 
for  simultaneous  tracking  and  distrubance  rejection  we  must  verify  that  there 
exists  one  choice  of  a  stable  g{s)  such  that  w(s)/>,U)  +  vp(s )  is  not  iden¬ 
tically  zero.  Upon  susbstituting  5.29  into  this  expression  we  obtain 

W(s)/I As)  +o As)  -  l u  is) u As) U  (s)d,ls)  -u.,(s)v  (i)o  (s)djs)]n  (s) 

'  '  *  P  '(5J0) 

+  vp(s)  +Hs)db{s)d/s)np(s) 


Now  dp(s)  and  dt(s)  are  not  identically  zero  since  they  represent  the 
denominators  for  well  defined  transfer  functions.  As  such,  if  n  As)  is  not 
identically  zero  5.30  will  be  non-trivially  dependent  on  g(s)  and  hence  not 
identically  zero  for  every  choice  of  g(s).  On  the  other  hand  if  np(s)  is  iden¬ 
tically  zero  the  equality  up(s)np(s)  +  vp(s)dp(s)  •  1  implies  that  vp(s)  is 
miniphase  hence 


*Hs)np(s)  +  vp(s)  »  vp(s)  (5-31) 

is  not  identically  zero.  Our  proof  is  therefore  complete. . 


Although  the  theorem  is  highly  complex  and,  indeed,  is  predicated  on  the 
equally  complex  theorems  which  preceded  it,  the  final  result  is  an  explicit 
description  of  the  desired  family  of  compensators.  Moreover,  the  terms  in 
this  expression  are  readily  computed  by  solving  one  or  more  coprimeness 
equation.  As  such,  the  result  is  easily  implemented  as  per  the  following  ex¬ 
ample. 


Id.  Example.  Continuing  our  analysis  of  the  plant  introduced  in  the 
previous  examples  we  will  investigate  the  possibility  of  simultaneously 
tracking  e^U  (/)  and  rejecting  U(r).  Here, 

(i-2) 

d.(s)  - -  (5.32) 

(J  +  2) 

and 


dAs)  m -  (5.33) 

(r  +  2) 

which  are  clearly  coprime.  Indeed 


- 1 


(i-2) 

l(J  +  2) 


(r  +  2)J 


utr{s)d,{s)  +  i>„(r)d,(r)  -  1  (5.34) 


As  such,  the  required  w (j)  takes  the  form 
1 


w(j) 


<r+2)JJ 


y*  +  4)  +  s(s  +  2)g{s) 


(5.35) 


Saks  rr  a l. 


yielding 

h 


Vi 


U>  .  flililf]  [il(x2+i.s+—  -  (*-2)(r+2)g(r)  (5.36) 

L(*  +  2)sJ  l  3  3  3  J 


and 


m  illjLj  ^9s*  +  12s1 +32I2-  112f -144)  +  9r(r+I)(x+2}r<x)j  (5J7) 
which  have  the  required  zeros  at  s  ■  0  and  s  «  2,  respectively. 
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Feedback  System  Design: 

The  Single -Variate  Case  -  Part  II* 

R.  Saeks1,  J.  Murray O.  Chuaz,  C.  Karmokolias 3, 
and  A.  Iyer 1 


Summary  of  Part  I.  The  present  article  represents  a  survey  of  a  new  frequency  do¬ 
main  approach  to  the  feedback  system  design  problem.  Although  the  theory  applies 
to  the  general  multivariate  case  and,  in  fact,  much  of  the  theory  can  be  extended  to  a 
general  ring  theoretic  setting  for  the  purpose  of  the  present  exposition  we  will  restrict 
ourselves  to  the  single-variate  case  wherein  the  simple  algebraic  nature  of  the  new 
theory  is  most  readily  apparent. 

In  Part  1 1  we  introduced  the  concept  of  a  stable  rational  fractional  representation 
for  a  feedback  system  and  derived  an  asymptotic  design  theory  therefrom.  This  in¬ 
cluded  a  complete  parameterization  for  the  set  of  compensators  which  stabilize  the 
system  under  appropriate  tracking  and/or  disturbance  rejection  constraints. 
Moreover,  expressions  for  the  feedback  system  gains  of  the  resulting  system  which 
are  linear  (actually  affine)  in  the  underlying  design  parameter  are  obtained.  As  such, 
the  process  of  choosing  a  specific  design  within  this  set  to  meet  additional  specifica¬ 
tions  is  greatly  simplified. 

In  Part  II  we  investigate  a  number  of  these  constraints.  These  include  the  robust 
design  problem,  transfer  function  design,  the  pole  placement  problem,  the 
simultaneous  design  problem,  and  the  problem  of  designing  a  stable  stabilizing  com¬ 
pensator. 

6.  Robust  design 

Although  the  design  algorithms  formulated  in  the  preceeding  sections 
theoretically  achieve  the  prescribed  specification,  in  practice,  they  are  often 
inapplicable  since  the  specified  plant  model  may  not  be  exact.  As  such,  we 
would  like  to  formulate  a  design  theory  which  is  robust  in  the  sense  that  the 
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prescribed  design  specifications  are  achieved  for  any  plant  in  a  “neighbor¬ 
hood”  of  the  nominal  plant  p(s).  Then,  if  pis)  is  a  “reasonable”  approx¬ 
imation  to  the  actual  plant,  our  design  formulated  in  terms  of  pis)  will  aiso 
work  for  the  actual  plant.  Although,  in  practice,  pis)  is  not  an  exact  model 
for  the  plant  it  is  reasonable  to  assume  that  c(s),  r(s),  and  r(s)  are  exact, 
indeed,  cis)  is  under  control  of  the  system  designer  while  t(s)  and  r(s)  Are 
simply  mathematical  artifices  which  model  the  signals  we  desire  to  track 
and/or  reject. 

In  various  abstract  control  theories  one  can  spend  hours  making  the  term 
“small-perturbation”  precise  f  1 8  J.  For  our  single-variate  case,  however,  it 
suffices  to  say  that  pis)  -  n^is)/dpis)  is  close  to  pis)  =  np(s)/dp(s)  if 
the  coefficients  of  n^is)  are  close  to  those  of  np{s)  and  those  of  d^is)  are 
close  to  those  of  dpis).  Indeed,  it  can  be  shown  that  this  is  a  well  defined 
concept  so  long  as  both  fractional  representations  are  coprime  [11,18]. 
Since  the  poles  and  zeros  of  a  rational  function  are  continuous  functions  of 
its  coefficients  it  then  follows  that  a  small  perturbation  of  a  miniphase  func¬ 
tion  is  miniphase  [18].  That  is,  since  both  the  poles  and  zeros  of  such  a 
function  lie  in  the  (strict)  left  half-plane  a  small  perturbation  of  its  coeffi¬ 
cients  will  not  move  them  to  the  right  half-plane. 

The  above  observation,  however,  is  sufficient  to  prove  that  any  stabiliz¬ 
ing  controller  is  robust.  Indeed,  our  stabilizing  controllers  are  designed  so 
that 


dp(s)dcis )  +  np(s)ncis)  »  1  (6.1) 

As  such,  if  the  nominal  plant,  p(s)  *  np{s)/dp{s),  is  replaced  by  the  ac¬ 
tual  plant  p(s)  *  n-{s)  /d-is)  but  we  still  use  our  design  compensator,  the 
common  denominator,  for  our  feedback  systems  gains  will  become 

dfiis)dc(s)  +  n^is)ncis)  •  k[s)  (6.2) 

which  is  still  miniphase  if  pis)  is  close  to  pis).  Thus  the  feedback  system 
will  still  be  stable  even  though  the  common  denominator  is  no  longer  the 
identity. 

17.  Property.  Every  stabilizing  compensator  is  robust. 

Unlike  the  design  of  a  simple  stabilizing  compensator  wherein  robustness 
takes  care  of  itself,  a  compensator  which  meets  a  tracking  and/or  distur¬ 
bance  rejection  specification  may  fail  to  be  robust  [11].  Interestingly, 
however,  in  our  single-variate  case  if  the  coprimeness  conditions  required 
for  the  existence  of  a  solution  to  the  tracking  and/or  disturbance  rejection 
problem  are  satisfied  then  a  robust  solution  is  guaranteed  to  exist  though 
the  set  of  robust  compensators  may  be  a  strict  subset  of  the  set  of  compen¬ 
sators  formulated  in  the  previous  sections. 

Recall  that  the  solution  of  the  tracking  problem  requires  a  stabilizing 
compensator  such  that  he)M]  (s)r(r)  is  also  stabilized.  Now,  if  we  use  the 
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actual  plant, />($)  =  np(s)/dp{s),  rather  than  the  nominal  plant  Equations 
2.9  and  3.1  yield 

-  di{s)d.(s)n,{s) 

hf]lt.(s)Hs)  *  - 2 - £ - ! -  (6.3) 

ld-(s)dc(s)  +/t-(s)ne(s)]dr(s) 


The  robust  stabilization  property  implies  that  dj(s)dc(s)  +  n^s)nc(s)  is 
miniphase  when  p(s)  is  close  to  p(s)  and  hence  6.3  will  be  stable  if  and  only 
if  the  numerator  factors  in  6.3  cancel  any  instabilities  in  dt(s).  Of  course, 
n,(s)  and  d,(s)  are  coprime  implying  that  nt(s)  will  not  help  to  cancel  the 
instabilities  in  d,(s) .  On  the  other  hand  we  cannot  use  d-(s)  to  cancel  these 
instabilities  since  it  is  not  known  exactly2.  As  such,  in  a  robust  compensator 
dc(s)  must  cancel  any  instabilities  in  dt(s).  In  our  setting  this  reduces  to  the 
requirement  that 

dc(s)  =  -x'(s)d,(s)  (6.4) 

for  some  stable  x'  (5).  Here,  the  minus  sign  has  been  assumed  for  nota- 
tional  convenience  only.  Of  course,  it  follow-s  from  the  stabilization 
theorem  that  in  every  stabilizing  compensator  dc(s)  takes  that  form 

dc(S )  a  H'(s)/Ip(s)  +  Vp(S)  (6.S) 

Upon  combining  6.4  and  6.5  we  thus  arrive  at  the  following  design  equation 
for  the  desired  robust  tracker. 

18.  Property.  Given  p(s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  2  which  robustly  stabilizes  the  system  and  simultaneously 
causes  it  to  robustly  track  the  impulse  response  of  i(s)  if  and  only  if  the 
equation 

n(s)np(s)  +x’(s)d,(s)  *  -t'pU) 

admits  stable  solutions  m  (5)  and  x'  (s)  such  that  w(s)np  +  vp(s)  is  not 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

1  -  w(s)d As)  +u  U)] 

c{s)  ~  - 2 - £ - 

[w(s)rtp(j)  +Kp(5)) 

where  w(s)  is  a  solution  of  the  above  equation. 

Note  that  if  we  multiply  the  above  design  equation  by  dp(s)  we  obtain 

**  (f)  np(s)dp(s)  +  U’ (s)dp(r)  ]d,(i)  =  -vp(s)dp(s)  *  up(s)np{s)  -  1  (6.6) 

hence  if  w(r)  satisfies  the  above  design  equation  it  also  satisfies  the  design 
equation  of  Property  11  with  x(s)  =  x'  (s)dp(s).  As  such,  our  compen¬ 
sators  for  the  robust  tracking  problem  do,  indeed,  satisfy  the  criteria 
developed  in  Section  3  for  the  tracking  problem.  On  the  other  hand  some 
of  the  w(j)’s  which  satisfy  the  design  equation  of  Property  11  may  fail  to 
satisfy  the  above  design  equation. 
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To  parameterize  the  solution  space  for  the  robust  tracking  problem  it  suf¬ 
fices  to  characterize  the  stable  solutions  of 

w(s)np(s)  -x  (s)drls)  *  -(■'pis)  (6.7) 

Indeed,  the  same  coprimeness  condition  employed  in  the  tracking  theorem 
also  suffices  for  robust  tracking  even  though  the  resultant  solution  space  is 
smaller. 

Robust  Tracking  Theorem:  Given  pis)  there  exists  a  compensator  for  the 
feedback  system  of  Figure  2  which  robustly  stabilizes  the  system  and  simul¬ 
taneously  causes  it  to  robustly  track  the  impulse  response  of  t(s)  if  and  only 
if  rtp(s)  and  d,(s)  are  coprime.  In  this  case  let  u^fs)  and  v^is)  be  stable 
functions  such  that 

up'is)Hpis)+vpi{s)d,{s)  -  1 

Then  the  desired  set  of  compensators  takes  the  form 
,  ,  (-w(j)rf  (i) +«  (s)l 


an 


(w(r)^(r)  +rp(r)] 

where 

w(j)  •  ~upl{s)ip(s)  +e'(s)d,(r) 

with  t‘  (s)  an  arbitrary  stable  function  such  that  w(s)np(s)  +  vp(s)  is  not 
identically  zero. 

Proof.  Since  any  robust  tracker  is  also  a  tracker  the  necessity  for  the  co¬ 
primeness  condition  follows  from  the  tracking  theorem.  To  show  that  the 
coprimeness  condition  is  also  sufficient  we  multiply  the  equality 

>'pl(s)*p(s)  +Vp,is)dl(s)  m  i  (6.8) 

by  - 1 ’pis)  obtaining  a  particular  solution  to  6.7  in  the  form 

w*(j)  ■  -uptis)Vp(s)  (6.9) 

and 

X’P{S)  m  -Vp,iS)VpiS)  (6.10) 

Similarly,  the  homogeneous  solutions  to  6.7  take  the  form 

w*(s)  -  e'is)d,is)  (6.11) 

and 

jt'*(j)  -  -e'is)fip(s)  (6.12) 

As  in  the  previous  derivations  one  can  use  the  coprimeness  of  np(s)  and 
d,(s)  to  verify  that  6.11  and  6.12  range  over  all  possible  homogeneous  solu¬ 
tions  of  6.7  as  e'  (s)  ranges  over  the  stable  functions.  The  set  of  w'(s)  which 
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achieves  robust  tracking  thus  takes  the  form 

m»(j)  «  -uplis)vp(s)  +  e'is)d,is)  (6.13) 

where  e'  ( s )  is  an  arbitrary  stable  function  such  that  w(s)np(s)  +  vp(s)  is 
not  identically  zero.  Also  as  in  the  previous  derivations  one  can  show  that  a 
stable  e'  ( s )  exists  such  that  w(s)np(s)  +  vp(s)  is  not  identically  zero  which 
completes  the  argument  that  the  coprimeness  condition  is  sufficient  for 
robust  tracking. 

If  one  recalls  from  equation  5.9  that  dt{s)  —  k(s)da(s)  for  some  stable 
k(s)  and  compares  6.13  with  the  corresponding  equation  for  the  tracking 
problem  we  conclude  that  the  robust  trackers  correspond  to  the  subset  of  all 
trackers  defined  by  a  stable  function,  e(s) ,  which  takes  the  form 

e(s)  *  k{s)e'  is)  (6.14) 

As  such,  we  can  construct  a  family  of  robust  trackers  simply  by  using  this 
class  of  e(s)  in  the  formulae  derived  in  Section  3.  Moreover,  since  d{s)  will 
be  a  unit  if  and  only  if  dpis)  and  dt(s)  are  coprime  we  have  the  following 
corollary. 

19.  Corollary.  Every  solution  to  the  tracking  problem  is  robust  if  and  only 
if  dp(s)  and  d,(s)  are  coprime. 

It  is  also  interesting  to  note  that  the  derivation  of  our  robust  tracker  is 
considerably  simpler  than  that  of  the  (non'robust)  tracker  since  we  do  have 
to  construct  a  coprime  fractional  representation  for  a(s)  *  dp(s)/d,(s). 
Indeed,  since  small  perturbations  onp(s)  are  assumed  dpis)  and  d,is)  will 
always  be  coprime  for  some  choices  of  pis)  and,  as  such,  the  robust  theory 
cannot  exploit  cancellations  between  the  nominal  dp{s)  and  dtis)  even 
though  they  may  not  be  coprime. 

20.  Example.  Consistent  with  Equation  6.14  one  can  construct  robust 
trackers  for  the  plant 


pis)  - 


is+l ) 
is 2  -  4) 


(6.15) 


of  Example  12  simply  by  letting  e(s)  =  k(s)e'  is)  in  those  examples.  In  the 
case  where  we  desire  to  track  a  step  function  dp(s)  and  d,is)  are  coprime 
via  Equation  3.38  and,  as  such,  the  corollary  implies  that  the  entire  set  of 
tracking  compensators  characterized  by  Equations  3.39  and  3.40  are  robust. 

On  the  other  hand  if  we  desire  to  track  e2'U  it),  dpis)  and  dtis)  are  no 
longer  coprime  as  per  Equation  3.44.  Indeed,  in  this  case 


d,is)  • 


is -2) 

-  dais)  *  k(s)dais) 

(r  +  2)J 


(6.16) 


Saeks  rr  Al¬ 
and,  as  such,  the  substitution  of  6.14  into  3.46  yields  a  set  of  robust  com¬ 
pensators  characterized  by 

‘  I6is + 2/})~'  I*  (s  —  2) 

*<*)  m  -  -  +  -  t'  is)  («».Z7) 

L  3(s  +  2)  J  L(s  +  2). 

where  e'  is)  is  an  arbitrary  stable  function. 

The  arguments  required  to  design  compensators  for  robust  disturbance 
rejection  and/or  simultaneous  robust  tracking  and  robust  disturbance  rejec¬ 
tion  follow  immediately  upon  combining  the  techniques  used  in  Sections  4 
and  3  with  the  ideas  discussed  above.  As  such,  we  sill  simply  sketch  the  ideas 
and  state  the  main  results.  For  robust  disturbance  rejection  we  require  that 


h»iH  is)r(s) 


n£{s  )ne(s)nr(s) _ 

d-is)deis)  +itj(s)ne(s)d,(s) 


be  stable  for  all  pis)  in  a  neighborhood  of  pis) .  As  with  the  tracking  prob¬ 
lem  this  will  be  achieved  over  the  entire  neighborhood  if  and  only  if  there 
exists  a  stable  y‘  (s)  such  that  ne(s)  »  y'  ( s)dt(s )  in  which  case 

"r is)  »  -  *is)dpis)  *  up{x)  •  y'  is)dris)  (6.19) 

21.  Property.  Given  pis)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  3a  which  robustly  stabilizes  the  system  and  simultaneously 
causes  it  to  robustly  reject  the  impulse  response  of  r( s)  if  and  only  if  the 
equation 

wis)dpis)  +y'is)dris)  »  upis) 

admits  stable  solutions  w(s)  and  y'  is)  such  that  w{s)np{s)  +  vpis)  is  not 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

1-  w{s)a  is)  +  u  is)] 

c is)  »  - 2 - £ - 

\«is)np{s)  +vpis)\ 

where  w(j)  is  a  solution  of  the  above  equation. 

Robust  Disturbance  Rejection  Theorem:  Given  pis)  there  exists  a 
compensator  for  the  feedback  system  of  3a  which  robustly  stabilizes  the 
system,  and  simultaneously  causes  it  to  robustly  reject  the  impulse 
response  of  r(s)  if  and  only  if  dpis)  and  dris)  are  coprime.  In  this  case 
let  u^is)  and  v^is)-  be  stable  functions  such  that 


Upis)dpis)+Vpis)dris)  -  l 
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Then  the  desired  compensators  take  the  form 

I  -  w(s)  +u  (s)] 

c(j)  «  - - - 1 - 

[w(S)«p(i)  +fp(S)l 

where 


w{s)  »  upr(s)up(s)  +f(s)d,(s) 

with  f'(s)  an  arbitrary  stable  function  such  that  w (s)np(s)  +  vp(s)  is 
not  identically  zero. 

22.  Corollary.  Every  solution  to  the  disturbance  rejection  problem  is 
robust  if  and  only  if  np(s )  and  dr(s)  are  coprime. 

23.  Corollary'-  Given  p(s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  4  which  robustly  stabilizes  the  system,  causes  it  to  robustly 
track  the  impulse  response  of  t(s)  and  simultaneously  causes  it  to  robustly 
reject  the  impulse  to  r(s)  if  and  only  if  the  pair  of  equations 

w{s)np[s)  +x'(s)d,(s)  *  -vp(s) 

and 


«  up(s) 

admits  stable  solutions  w(s),  jr(s),  andy(s)  such  that  w{s)np{s)  +  t,y(s)  is 
not  identically  zero.  In  this  case  the  required  compensators  take  the  form 

(-w(s)d  (#)  +u  (s) ) 

ds)  - - Z - i - 

l  *<*)»»,(*)  +vp(s)] 

where  w(s)  is  a  solution  to  the  above  equations. 

Simultaneous  Robust  Tracking  and  Robust  Disturbance  Rejection 
Theorem:  Given  p(s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  4  which  robustly  stabilizes  the  system,  causes  it  to 
robustly  track  the  impulse  response  of  t(s)  and  simultaneously  causes  it 
to  robustly  reject  the  impulse  response  of  r{s)  if  and  only  if 
(i)  n  As)  and  d,(s)  are  coprime, 

(ij)  a  As)  and  dr{s)  are  coprime,  and 
(iii)  d,(s)  and  dr(s)  are  coprime. 

In  that  case  the  desired  set  of  compensators  take  the  form 


~w{s)d  As)  +  u(s) 

c(s)  m  - Z - £ - 

'"is)npis)  +  vp(s) 

where 

w{s)  *  luplis)up(s)ulris)d,is)-upiis)vpis)vrris)dris))  +g'is)dr{s)dtis) 
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with  g is)  an  arbitrary  stable  function  such  that  wis)np{s)  +  vpis)  is  not 
identically  zero. 

Corollary.  Every  solution  to  the  simultaneous  tracking  and  disturbance 
rejection  problem  is  robust  if  and  only  if 
Q)  dp(s)  and  d,(s)  are  coprime  and 
(ii)  np(s)  and  d,(s)  are  coprime. 

7.  Transfer  function  design 

Rather  than  asking  that  a  system  meet  a  tracking  or  disturbance  rejection 
specification  we  may  attempt  to  fully  specify  one  or  more  of  the  system 
gains  while  simultaneously  stabilizing  the  system.  For  instance  we  may  re¬ 
quire  that 

•  h(s)  (7.1) 

where  his)  is  a  prescribed  stable  transfer  function.  Of  course  Corollary  2.6 
implies  that 

“  -w(s)",is)dpls)  +up(s)npis)  (7.2) 

so  that  our  stable  design  problem  reduces  to  finding  a  stable  w(s)  such  that 
A(S)  *  -W iS)npiS)dp{t)  +UpiS)ltpiS)  (7J) 

Transfer  Function.  Design  Theorem:  Given  pis)  *  0  there  exists  a 
compensator  for  the  feedback  system  of  Figure  1  which  stabilizes  the 
system  and  yields  *V)  (j)  m  his )  for  some  stable  h(s)  if  and  only  if 
(0  np{s)  divides  h(s)  and 
(ii)  dpls)  divides  (l-A(s)l. 

Proof.  If  7.3  is  satisfied 

h{s)  m  -  wis)npis)dpls)  +Up(s)npls)  •  [  -  w{s)dpis)  +upls)  )npis)  (7.4) 
showing  that  np(s)  divides  A(s).  Similarly, 

ll -his) J  ■  "(s)*fis)dpis)-up(s)np{s)  +  l 

-  wis)npis)dpis)  +vpis)dpis)  (7J) 

■  ( w(J)ffp(J>  *  Vpis)  1  dpis) 

showing  that  dp(s)  divides  [1  -  his)  ]. 

Conversely,  since  pis)  •  0,  npis)  •  0  and  hence  the  unique  candidate 
for  w(j)  satisfying  7.3  is 


w(x) 


np{s)dpis) 


(7.6) 
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which  is  stable  under  the  hypothesis  of  the  theorem.  Indeed,  since  np(s)  and 
dp(s)  are  coprime  and  both  npis)  and  dp(s)  are  non-zero 


1  y*>  “-(*> 

-  m  — C - +  _£ - 

np(s)dp{s)  np(s)  dp(s ) 

hence 


(7.7) 


H(J) 


«ytt)  y*> 

.  "pis)  dp(s )  . 


lup{s)np(s) -his)  J 


*  vp(*)upis)  + 


vpis)h(s) 

"p(S) 


u  (s) 

+  — - [l  -h(s)  -  vp(s)dp(s)  ] 

dp(s) 


vM)h(s)  u  (5)[1 -AU)] 

»  _£ - +  _£ -  (7.8) 

np(s)  dP{s) 

which  is  stable  since  np(s)  divides  h{s)  and  dp(s)  divides  [1  -  h(s)  ]. 

Note  that  if  p(s)  is  miniphase  both  np(s)  and  dp(s)  are  miniphase,  in 
which  case  the  divisibility  conditions  of  the  theorem  hold  for  all  stable  h{s). 

25.  Corollary.  If  p(s)  is  miniphase  every  stable  A(j)  can  be  realized  as  the 
input-output  gain  of  the  feedback  system  of  Figure  1  with  a  stabilizing 
compensator. 

Of  course,  one  can  give  a  similar  argument  for  the  realization  of  any  of 
the  eight  feedback  system  gains  of  Corollary  6.  Note,  however,  that  the 
w(s )  which  simultaneously  stabilizes  the  system  and  achieves  the  prescribed 
gain  is  unique  and  hence  no  further  design  latitude  exists  once  such  a  gain 
has  been  realized. 


26.  Example.  For  the  plant 


pis)  ■ 


(r+i) 

<s2-4) 


(J+i) 

,(J+2)2. 

'  is—1) 

.  U  +  2)  . 


yj> 

dpis) 


(7.9) 


rrp(s)  is  miniphase  and  thus  divides  every  stable  h(s).  As  such,  the  only 
constraint  on  ( s )  for  a  feedback  system  built  around  this  plant  is  that 

(j-2)/(j  +  2)  divides  *►,„,($).  That  is  h,yi(s)  must  be  stable  and  have  a 
zero  at  s  »  2. 
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Of  course,  in  applications  pis)  is  only  an  approximation  to  the  actual 
plant,  pis) ,  and,  as  such,  it  behooves  us  tc  investigate  the  degree  to  which 
our  design  based  on  nominal  plant  information  remains  valid  for  the  actual 
plant.  To  this  end  we  use  the  nominal  plant  model  to  design  the  compen¬ 
sator  using  the  w(s)  of  Equation  7.8  and  then  substitute  the  resultant  nc  i s ) 
and  dc(s)  into  the  formula  for  (s)  given  in  Equation  2.10  with  the  ac¬ 
tual  plant  model  defining  n^is)  and  d-(s).  A  little  algebra  will  then  yield 


his)n-is) 

d^is)deis)  *n-is)ne(s)np(s) 


Since  (<^(r)df(r)  +  n^is)neis)  ]  is  1  whenp(s)  »  pis)  the  expression  of 
Equation  7.10  reduces  to  /r,J#1(r)  *  his)  when pis)  »  pis).  Moreover, 
since  np(s)  divides  his)  and  [d^{s)dcis)  +  nJlis)neis)  ]  is  miniphase  when 
pis)  is  close  to  pis)  is)  is  stable  when  pis),  as  is  the  feedback  system 
itself.  Finally,  since  the* inverse  of  a  miniphase  function  is  continuous  in  a 
neighborhood  of  1,  is)  is  not  independent  of  pis)  in  a  neighborhood 
of  pis).  Thus,  even  though  ti^As)  is  not  independent  of  pis)  the  system 
is  well  behaved  fo  rpis)  in  a  neighborhood  of  pi  s)  and  small  changes  in  the 
plant  will  manifest  themselves  in  smalt  changes  in  is). 

Finally,  we  observe  that  in  the  process  of  realizing  a  prescribed  input- 
output  gain  exactly  we  have  used  all  available  design  latitude.  As  an  alter¬ 
native  one  might  choose  to  only  partially  specify  (s).  Indeed,  this  can 
be  done  allowing  one  more  freedom  in  the  choice  of  w(j)  and  simul¬ 
taneously  permitting  the  constraints  on  np{s)  and  dpis)  to  be  relaxed  [3  ] . 
Although  such  a  design  can  be  implemented  by  characterizing  those  w(s) 
which  achieve  the  prescribed  partial  specifications  it  is  often  more  conve¬ 
nient  to  reformulate  the  problem  in  an  appropriate  abstract  ring  whose 
elements  achieve  the  prescribed  partial  specification.  In  particular,  the  pole 
placement  problem  can  be  resolved  by  such  an  approach  [8].  As  such,  we 
will  put  off  our  investigation  of  this  problem  to  the  concluding  section, 
wherein  it  will  be  discussed  in  the  context  of  a  ring  theoretic  generalization 
of  our  theory. 


8.  Pole  placement 

Although  the  result  of  the  previous  section  yields  a  definitive  criterion  for 
the  exact  realization  of  a  prescribed  transfer  function  while  simultaneously 
stabilizing  the  system  the  resultant  criterion  may  be  too  restrictive.  Indeed, 
for  all  practical  purposes  the  transfer  function  design  theorem  requires  that 
pis)  be  miniphaseJ.  Moreover,  in  the  case  the  compensator  is  completely 
determined  by  his)  leaving  no  further  design  latitude.  As  such,  we  prefer  to 
work  with  a  less  restrictive  pole  placement  problem  wherein  we  desire  to 
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find  a  stabilizing  compensator  such  that 


h*2»As)  - -  (8.1) 

*<*> 

where  q(s)  is  a  prescribed  Hurwitz  denominator  polynomial  and  r  (s)  is  an 
arbitrary  numerator  polynomial  which  has  no  common  factors  with  q(s) 
and  whose  order  is  less  than  or  equal  to  that  of  q(s)  (so  as  to  guarantee  that 
h. y,  ( s )  will  have  no  poles  at  infinity). 

Since  q(s)  is  Hurwitz  (for  stability)  any  common  factors  between  r(s ) 
and  q(s)  will  be  in  the  left  half  plane  and,  as  such,  they  must  be  character¬ 
ized  by  polynomial  coprimeness  criterion  rather  than  our  stable  rational 
function  criterion  which  only  characterizes  common  factors  in  the  right 
half-plane.  As  such,  we  will  use  the  term  polynomial  coprime  to  distinguish 
this  criterion  for  our  usual  stable  rational  function  coprimeness.  Consistent 
with  our  use  of  the  polynomial  coprimeness  concept  we  assume  that 


V5>  “ 


at(,s)or(s) 

m[s) 


(8.2) 


and 


bf(sW(s) 

m(s) 


<8.3) 


where  m(s )  is  a  Hurwitz  polynomial  characterizing  the  poles  of  np(s)  and 
dp(s);  ar(s)  and  tfis)  are  Hurwitz  polynomials  characterizing  the  (strict) 
left  half-plane  zeros  of  np(s)  and  dp(s) ,  respectively;  while  a'(s)  and  b'(s) 
are  anti-Hurwitz  polynomials  characterizing  the  finite  (closed)  right  half¬ 
plane  zeros  of  np(s )  and  dp(s) ,  respectively.  Since  np(s)  and  dp(s)  have  no 
poles  at  infinity 

o(m)  a  o(o0  +o(ar)  (8.4) 


and 


o(m)  zo(t/)+o(y)  (8.5) 

where  o(  )  denotes  the  order  of  the  appropriate  polynomial.  Moreover, 
since  np(s)  and  dp(s)  are  coprime  they  cannot  have  a  common  zero  at  s  =  0 
and,  as  such,  one  of  the  inequalities,  8.4  or  8.5,  must  be  satisfied  with 
equality.  Indeed,  if  we  let 

k  =  >(«')  +o<(f) -o(bl)  -oilfY,  (8.6) 

then  either 
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o(m)  •  o(fl'>  +  o(ar)  +  k  «  o(b‘)  +  o(br)  (8.8) 

Using  this  notation  we  obtain  the  following  fundamental  result. 

Property.  Given  p(s)  there  exists  a  compensator  for  the  feedback  system 
of  Figure  i  which  stabilizes  the  system  and  yields  »  r(s)/q(s) 

where  q(s)  is  a  prescribed  Hurwitz  polynomial,  r(s)  is  an  arbitrary 
polynomial  such  that  o(r)  s  o(q),  and  q(s)  and  r(s)  are  polynomial 
coprime  if  and  only  if  the  equation 

a(s)o'is)  +3(s)br(s)  •  q(s) 

admits  polynomial  solutions,  a(s)  and  8(s).  such  that 

(i)  o(a)  s  o(q)  +0(0*)  - o(m ) 

(ii)  o(i 3)  s  o(q)  +o(b ')  - o(m ) 

(iii)  a (s)  and  0(s)  are  polynomial  coprime. 

Proof.  If  there  exist  polynomials  <*(j)  and  8 (s)  satisfying 

a(r)«r(s)  +8(s)b'(s)  »  q(s)  (8.9) 

and  (i)-(iii)  we  define  w(s)  by 

B(s)m(s)  a(s)mis) 

*<*)  a  - - - U As) - - - Vp(S)  (8.10) 

q(s)y(s)  p  q(s)a\s) 

Now,  q(s)  and  b*(s)  are  both  Hurwjtz  polynomials  while  o(8m)  s  o(qb') 
via  (ii)  verifying  that  8(s)m(s)/q(s)bl(s)  is  a  stable  rational  function. 
Similarly,  the  fact  that  q(s)  and  o'(s)  are  both  Hurwitz  together  with  (i)  im¬ 
plies  that  a (s)m(s)  /q(s)at(s)  is  stable.  As  such,  w(j)  is  represented  as  the 
sum  or  products  of  stable  functions  and  is  thus  stable. 

Using  the  w(s)  of  8.10  to  define  a  compensator  for  our  system  via  the 
stabilization  theorem  we  obtain 

r(s) 

hr^(s)  *  -»(s)nAs)d  (s)+u  (s)n  (s)  • -  (8.13) 

‘  p  p  p  p  q{s) 

8(s)m(s)  a  (s)m(s) 

•  l - - - tt  As) - - - v(s)]n(s)d  As)  +u  As) n  (s) 

q(s)bl(s)  p  q(s)a*(s)  p  p 

3(s)m(s)n (s)da(s)u As)  a(s)m(s)n  (s)d  (s)v  (s) 

- - Z - Z - Z - +  _ - Z - Z - Z — +  u(s)n  As) 

q(s)br(s)  q(s)al(s) 


3(s)b'(s)nAs)uAs )  a(s)ar(s)d.(s)v„(s) 


+  V5)VJ> 


n  (s)u  (s)  a (s)a'(s)dAs)i-AS) 

[<7U>  -3(s)V(s)  1  -2 - 2 - + - 2 - Z - 

q(s)  <7(s) 
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a(s)o'(s) 

q(s) 

(x(s)a'(s) 

q(s) 

a(s)a'(s ) 
q(s) 


■np(s)up(s)  + 


q(s) 

\up(s)np(s) +vp(s)dp(s)] 


dp(s)vp(s) 


(8.11) 


To  verify  that  this  is  the  required  transfer  function  we  must  show  that  r(s) 
-  a(s)a'(s)  and  q(s)  are  polynomial  coprime  and  that  o(r)  £  o(q). 
Now,  it  follows  from  8.4  and  (i)  that 


o(r)  *  o(a)  +  o(ar)  £  o( a)  +o(m)  -o(a') 

£  o(q)  +o(a')  - o(m )  +o(m)  -0(0')  =  o{q) 


(8.12) 


verifying  that  (j)  has  no  poles  at  infinity.  On  the  other  hand  if  r(s)  = 
a(s)o'(s)  and  <?(j)  are  not  polynomial  coprime,  say  they  have  a  common 
zero  air  s  jg,  then  8.9  implies  that  0(s)lf(s)  also  has  a  zero  at  So-  Moreover, 
since  q(s)  is  Hurwitz,  Sq  must  lie  in  the  strict  left  half-plane  implying  that  it 
is  actually  a  common  zero  of  a  (5)  and  0(s )  since  the  zeros  of  </(*)  and 
br(s)  are  in  the  right  half-plane.  This  is,  however,  impossible  since  a(s) 
and  0(s)  are  assumed  to  be  polynomial  coprime.  As  such,  r(s)  and  q(s) 
are  polynomial  coprime  completing  our  sufficiency  proof. 

To  verify  the  necessity  of  conditions  (i)-(iii)  assume  that  there  exists  a 
stable  w(s)  such  that 


*r2B,(s)  =  -  w(s)np(s)dp(s)  +  up(s)np(s) 


r(s) 

q(s) 


(8.13) 


Now,  substitution  of  8.2  and  8.3  into  8.13  implies  that 


up(s)  *  w(s)dp(s)  + 


r(s)m(s) 

q(s)a'(s)ar(s) 


(8.14) 


but  since  up(s)  is  stable  while  m(s),  q(s)  and  ar(s)  are  Hurwitz  the  anti- 
Hurwitz  polynomial  a'(s),  in  8.14  must  be  cancelled  by  a  corresponding 
factor  in  r(s).  As  such  8.14  implies  that  there  exists  a  polynomial  a(s)  such 
that 

r(s)  =  a(j)a'(s)  (8.15) 

Moreover,  since  up(s )  is  stable  and  given  by  up(s)  =  r(s)m(s)  /q(s)a‘(s) 
o(a)  £  o(q)  +o(a')  -o(m)  (8.16) 
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verifying  (i).  Similarly;  ii  follows  from  the  equality  up{s)np{s) 
+  vp(s)dp(s)  *  1  together  with  8.13  and  8.15  that 


vp(s)  +  »(x)/»pU) 


(<7<f)  -a(x)tf'(.r)  ]m(s) 
<?(s)y(s)t>‘(s) 


As  before,  since  vp(s)  +  w(s)np(s)  is  stable  while  q(s).  &(*)  and  m(j)  are 
Hurwitz  it  follows  that  there  exists  a  polynomial  $(s)  such  that 

(q(s) -a(s)a'(s)}  «  Bis)b'(s)  (8. IS) 

which  is  the  desired  equality.  Furthermore,  since 


rp(x)  +  *is)np(s) 


S(s)mls) 

q(s)bt(s) 


is  stable  it  follows  that 


o<3)  s  o(q)  +  c(b/) -o(m)  (8.20) 

verifying  (ii).  Finally,  if  s0  were  a  common  zero  of  a  (s)  and  0(s)  it  would 
also  be  a  zero  of  r(s)  »  ais)a'(s)  and  qU)  »  a(s)a'ls)  +  (3(s)br(s) 
which  is  impossible  since  r(s)  and  <7(5)  are  assumed  to  be  polynomial 
coprime.  As  such,  a(j)  and  3(s)  are  polynomial  coprime  verifying  (iii)  and 
completing  the  proof. 

Consistent  with  Property  27  the  solution  of  our  pole  placement  problem 
reduces  to  the  solution  of  the  linear  polynomial  equation 

als)a'(s)  +B(s.)b'{s)  m  q(s)  (8.21) 

under  constraints  (i)-(iii)  of  Property  27.  Now,  since  np(s)  and  dp(s )  are 
coprime  they  have  no  common  right  half-plane  zeros  hence  a'(s)  and  b'(s) 
are  polynomial  coprime  which  implies  that  8.21  is  soluable.  It  is  not, 
however,  clear  that  8.21  represents  a  solution  satisfying  constraints 
(i)-(iii)  of  Property  27.  Indeed,  8.21  represents  o(q)  +1  equations  in 
o(a)  +0(8)  4-2  unknowns  hence  for  a  solution  we  require  that 

ofq')  +  l  s  o(a) +0(5)  *2  (8.22) 

while  constraints  (i)  and  (ii)  together  with  8.7  (or  8.8)  imply  that 
o(q)  + 1  s  (o(q) +o(of) -o(o») +o(^) +o(bf) -o(m)  1 +2 


or  equivalently 


1 2 o(q)  -o(a')  - o(br )  -k-r-2 


o(q)  2  [o’(a')  *  o(b')  +  *1  -  1 
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As  such.  Equation  8.24  yields  a  necessary  and  sufficient  condition  for  the 
linear  equation  8.21  to  be  generically  solvable.  Indeed,  the  polynomial 
coprimeness  of  ar(s)  and  br(s)  implies  that  8.21  will  always  admit  coprime 
solutions  when  8.24  is  satisfied  (though  the  equation  may  also  admit  solu¬ 
tions  which  are  not  polynomial  coprime).  On  the  other  hand  if  8.24  is  not 
satisfied  8.21  is  generically  unsolvable  though  it  may  admit  solutions  for 
certain  qis)  which  lie  in  an  appropriately  lower  dimensional  subspace. 

Finally,  let  us  denote  the  integer  [o(ar)  +  o(br)  +  *]  by  rip)  and 
observe  that  it  represents  the  total  number  of  (closed)  right  half-plane  poles 
and  zeros  of  pis).  Indeed,  o(a')  is  precisely  the  number  of  finite  closed 
right  half-plane  zeros  of  pis)  while  oibr)  is  the  number  of  finite  poles  of 
pis)  and  k  represents  the  number  of  poles  or  zeros  at  infinity  (depending  on 
whether  8.7  or  8.8  holds).  As  such,  rip)  is  a  natural  measure  of  the  degree 
to  which  pis)  fails  to  be  miniphase.  Consistent  with  the  above  we  have 
verified  the  following. 

Pole  Placement  Theorem:  Given  pis)  there  exists  a  compensator  for 

the  feedback  system  of  Figure  1  which  stabilizes  the  system  and  yields 


hfmH.{S)  S  “ 

qis) 

where  <7(5)  is  a  prescribed  Hurwitz  polynomial,  ris)  is  an  arbitrary 
polynomial  such  that  oir)  s  oiq),  and  qis)  and  ris)  are  polynomial 
coprime  if 

oiq )  !£  rip)  -  l 

Conversely,  if  oiq)  <  rip)  -  1  the  pole  placement  problem  is  generi¬ 
cally  unsolvable. 


Finally,  we  consider  the  problem  of  parameterizing  the  solution  space  for 
the  pole  placement  problem.  Indeed,  it  follows  from  Property  27  that  the 
space  of  compensators  which  resolve  the  pole  placement  problem  for  a 
given  qis)  take  the  form 


cis)  = 


j  -  wjs)dpis)  -*•  Up (5)  ] 
(»'(j)np(j)  +  ip(s)] 


(8.2S) 


where  w(j)  is  given  by  8.10  with  a  is)  and  (3  is)  chosen  to  satisfy  8.21  under 
constraints  (i)-(iii)  of  Property  27  together  with  the  condition  that 
u'(5)np(j)  +vpis)  is  not  identically  zero.  Now,  if  o^(s)  and  2pis)  repre¬ 
sent  a  particular  solution  to  8.21  it  follows  from  the  polynomial 
coprimeness  of  a'(s)  and  b'is)  together  with  the  usual  arguments  that  the 
entire  solution  space  for  8.21  takes  the  form 


q(j)  =  arts)  +jis)b'is) 


(8.26) 
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d(r)  *  3Hs)-j(s)a'(s)  (8.27) 

where  j[s)  is  an  arbitrary  polynomial.  In  our  case,  however,  in  order  to 
satisfy  constraints  (i)  and  (ii)  of  Property  27  we  assume  that  ap(s)  and 
&>(s)  satisfy  these  constraints  in  which  case  we  may  guarantee  that  oris) 
and  3(s)  will  also  satisfy  constraints  (i)  and  (ii)  by  requiring  that 

o(j)  s  o(q)  -x(p)  (8.28) 

(which  faa  follows  from  Equation  8.7  and  8.8)4.  In  general,  however,  these 
solutions  may  or  may  not  be  polynomial  coprime.  As  such,  we  may 
parameterize  the  solution  space  for  the  pole  placement  problem  via: 

28.  Corollary'.  Let  p(s)  and  q(s)  be  given  and  let  a'U)  and  fip(s)  be 
solutions  of  Equation  8.21  such  that  o(a)  s  o(q)  +  o(of)  -b(m)  and 
o(S)  £  o(q)  +o(6')  -o(m).  Then  the  set  of  compensators  which  resolve 
the  pole  placement  problem  take  the  form 

t-w<j)d  (r)  +  ua(s)l 


(**(*)»,{*)  +w,(r)J 


where 


[ffris)  -»v(s)a'(j)]m(j) 
lo(s)bf<s)] 


up{s)  - 


[«*(*)  +j(s)6'(s)lm(s) 
[q(r)«'(r)l 


where y(r)  is  an  arbitrary  polynomial  such  that  o(J)  *  o(q)  -*(P)  and 
j(s)  is  chosen  so  that  a(s)  ■  a*(s)  +j{s)br{s)  and  3(r)  *  3p{s) 
- j{s)a'(s )  are  polynomial  coprime  and  w(s)np(s)  +  op(s)  is  not  iden¬ 
tically  zero. 

29.  Example.  Once  again  consider  our  usual  example  with  plant 

<s  +  1) 

p(S)  «  -  .  ■  (8.29) 

UJ-4) 

though  in  this  case  since  the  formulation  for  the  pole  placement  problem  re¬ 
quires  that  np(s)  and  d.(s)  have  a  common  denominator,  m(s),  we  will 
work  with  the  fractional  representation 


[  (J+D  ’ 

r 

i  — ! 

L(r  +  3)=J 


IaUJ 


.v.v.v 
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which  is  coprime  since 


(6  +  13)' 

r  (5+D  ■ 

.  (5+  1)  j 

l(5  +  3)2. 

(s2  -  4)1 

-Vs>nP(s>+vp(sK(s) 


1  (8.31) 


As  such,  for  this  example  m(s)  =  (s  +  3)2,  a'(s)  =  ( s+  1),  <7'(s)  =  1, 
bc(s )  =  (5  +  2),  and  br(s)  =  { s-2 )  while  r(p)  =  2.  According  to  the 
theorem  we  may  thus  realize  any  Hurwitz  denomination  of  order  greater 
than  or  equal  to  ! . 

Now%  consider  the  case  where  we  let  q(s)  =  s2  +  s+  1  in  which  case  we 
require  that 

o(a)  s  o(q)  +o(al) -o(m)  =  I  (8.32) 

and 


o(0)  s  o(q)  +o(b‘) -o(m)  =  1  (8.33) 

As  such,  8.21  is  satisfied  by 

1(35+ 1)1  ( 1  ]  +  [r]  [(5-2)]  -  a(s)a'(s) +0(s)b'(s)  =  q(s)  =  52  +  5+1  (8.34) 

where  a(s)  and  0(s)  are  both  first  order  and  polynomial  coprime.  Now,  if 
we  takey  (s)  »  0  in  Corollary  28  we  obtain 


**•(*) 


c(s)  = 


(j+3)?(3r*  +  &?-2) 

U2  +  *+lXr+lX*+2)J 

3(s  +  2)(-J3  +  2j2+  115  +  3) 
s(5+l)(s  +  3)2 


(8.35) 


(8.36) 


and 


3(5+  1) 

(52  +  5+l) 


a(s)ar(s) 

q(s) 


(8.37) 


as  required. 


9.  Simultaneous  stabilization 

Although  a  robust  design  will  remain  valid  under  small  plant  perturbations 
one  often  desires  to  design  a  compensator  which  will  meet  specifications 
over  a  wide  range  of  plants.  For  instance,  if  the  plant  contains  unknown  or 
variable  parameters;  say  corresponding  to  temperature,  altitude  or  load 
changes;  one  might  wish  to  design  a  compensator  which  meets  specifica¬ 
tions  independently  of  the  unknown  parameters.  Alternatively,  it  may  be  re¬ 
quired  to  design  a  compensator  for  a  multi-mode  plant  which  achieves  the 
prescribed  design  specifications  in  each  mode;  say,  a  two  speed  motor.  This 
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problem  which  has  received  renewed  attention  in  recent  years  is  termed  the 
simultaneous  design  problem  [7,17].  in  the  case  of  the  simultaneous 
stabilization  problem  a  complete  but  highly  abstract  theory  exists. 

In  the  present  section,  rather  than  summarizing  this  abstract  theory,  we 
will  restrict  ourselves  to  the  one  special  case  for  which  a  simple  frequency 
domain  theory  exists -the  simultaneous  stabilization  of  two  plants  [7]. 
Here,  we  asstime  that  two  d;stinct  plants,  pis)  *  pis),  are  given  and  we 
desire  tc  find  a  single  compensator,  c(x),  which  simultaneously  stabilizes 
both  plants.  We  therefore  assume  that  bothp(j)  and  p(s)  are  characterized 
by  coprime  fractional  representations 

nas) 

pis)  -  -£ —  (9.1) 

dpis) 

and 


where 


Pis) 


"pis) 

dpis) 


(9-2) 


and 


Ujis)n}is)  +Vjis)djis)  •  1 


(94) 


UfiiS)HfiiS)  +  Vjis)dfiis)  -  1  (9.4) 

for  appropriate  stable  functions  upis),  vpis),  Upis),  and  v^is). 

Now  it  follows  from  the  stabilization  theorem  that  the  stabilizing  com¬ 
pensators  for  the  two  plants  are  given  by 


[  -w<r)rf.(r)  +k-(j)  J 

c(j)  - - 2 - 2 - 

(w(s)*-(s)  +^(J)1 

and 


(94) 


[  -w(s)di(s) 

cis)  -  - 2 - 2 -  (9.6) 


where  w(s)  and  w(s)  are  stable  functions  such  that  *(s)n-(s)  +  v-(s)  and 
w(r)«pU)  +  Uj(s)  are  not  identically  zero.  In  our  application,  however,  we 
desire  tc  construct  a  single  compensator,  c  is) ,  for  both  plants  and  hence  we 
must  equate  9.5  and  9.6.  Recalling  that  the  above  coprime  fractional 
representations  for  the  compensator  are  unique  up  to  a  miniphase  factor 
(via  Property  1)  we  equate  the  numerators  and  denominators  of  9.5  and  9.6 
modulo  a  miniphase  factor,  mis),  obtaining 

~w(s)d-(s)  +  u-(j)  «  mis)  [ -w(s)d-is)  +  u-(5)  ]  (9.7) 

P  P  p  P 
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»(s)n-(.s)  +  v-(s)  =  m(s)  (w(s)rt^(s)  +  v-ls)  ]  (9.S) 

which  must  be  solved  for  stable  w(s)  and  w(s),  and  miniphase  m(s).  Re¬ 
arranging  the  above  equations  yields  the  matrix  equality 


f-d-(s)  d-(s)  1  w{s) 


m(s)u-(s)  -u-(s) 


L  n-(s)  n^s)  J  mls)w(s)  J  -  v-(s) 

Solving  8.9  via  Cramers  rule  then  yields 


m(s)w(s)  J  Inj(s)d^s) -/tj(s)dj(s)) 


‘  m(s)  -  [n^s)u-(s)  +  d^$)v^S)  1 
_m(j)  [h^(s)m^(s)  +d-(s)v^(s)]  -  1 


As  such,  we  must  construct  a  miniphase  m(s)  such  that 
fm(j)  -  [nAs)u-(s)  +di(s)t;£(f)}l 

*(j)  * - £ - £ - £ - £ - 

l"}U)dpis)  -n^s)d-(s) ) 


w(5)  = 


[n^s)u6(s)  +di{s)vi{s) )-" 
\n.(s)d-{s)-n.(s)d.(s)) 


are  stable. 

Since  both  the  numerator  and  denominator  in  the  above  expressions  are 
stable  w(s)  and  w(s)  will  be  stable  if  and  only  if  the  numerators  in  9.1 1  and 
'9.12  are  chosen  to  cancel  any  right  half-plane  zeros  in  [n-(s)d-(s) 
-n-(s)djis)].  That  is,  if 

-np{si)dp<s,)  m°  <9-13) 

for  some  right  half-plane  s(.  we  must  choose  m(s,)  such  that 

m(i;)  -  [HjisJUfils,)  +d-(s,)  +  V-(S')  ]  -  0  (9.14) 


+dfisi)*pty )  -  -^77  *  0  (9-15> 

While  these  may,  at  first,  appear  to  be  contradictory  requirements  the 
validity  of  9.13  and  9. 14  automatically  implies  that  of  9.15,  and,  as  such,  we 
may  simply  define  m(s,)  by  9.14  at  those  values  of  s,  where  9.13  holds.  Al¬ 
though  this  is  true  with  complete  generality  to  simplify  our  arguments  we 
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will  assume  that  d^{s)  and  d^(s)  are  coprime;  i.e.,  they  have  no  common 
(closed)  half-plane  zeros. 


30.  Lemma.  If  et^(s)  and  d-(s)  are  coptime  then  9.14  and  9.13  are 
uniquely  satisfied  by 


«(*,) 


W 


whenever  9.13  holds  for  r,  in  the  (closed)  right  half-plane. 


Proof.  First  let  us  show  that  under  our  coprimeness  assumption  both 
d-(st)  and  d-is^  are  non-zero.  If  d-(s)  «  0  then  9.13  implies  that 
n-is^d^Sj)  ■  0  which  is  impossible  since  both  n-(s)  and  d^{s)  are 
coprime  with  respect  to  d-(s).  Thus,  d-(st)  *  0  while  a  similar  argument 
applies  to  dAst).  Now,  substituting 


m  <s() 

into  9.14  together  with  9.13  yields 


W 


"i(Sj)  -  [n^sju^s,)  +di(si)v.(si)] 

-  d'lSi)  -  nilt‘)dilsi)up{si)  , 

djltj)  d-d,)  d-(st) 


dp(*,>  dpw 


+  v^Sj)d}(  s,)  1 


W  di(s<) 


via  9.3.  Similarly, 


I 


2£i^pWjW  t  di{ti)diis,)viisi)  d-B(s,) 

dpW  dpW  W 


d  AS:)  di(S.) 


p'  ' ' 
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«  W  m0 

^(*i)  rfp(s.) 


via  8.4 


(9.18) 


Note  that  the  lemma  remains  valid  even  without  the  coprimeness  assump¬ 
tions  on  dB(s)  and  d6(s)  though  if  both  of  these  functions  have  a  common 
right  half-plane  zero  at  s,  one  must  take  rrt(s,)  *  np(s,)ut(s,). 

Consistent  with  the  above  and  retaining  our  assumption  that  d^(s)  and 
dp (s)  are  coprime  the  resolution  of  our  simultaneous  stabilization  problem 
reduces  to  the  construction  of  a  miniphase  function  m(s),  such  that 


m(s)  m  dP.[Si)  (9.19) 

W 

at  each  closed  right  half-plane  zero  of  n-ls)d-(s)  -  n-(s)d-(s) .  Of  course, 
since  all  of  our  rational  functions  have  real  coefficients  the  set  of  points 
(Sf.ntg)  which  we  must  interpolate  is  symmetric  in  the  sense  that  (Sjtfh.)  is 
in  the  set  whenever  is  in  the  set  where  the  “overbar”  notation  in¬ 

dicates  complex  conjugation.  The  required  miniphase  interpolation  lemma 
was  originally  derived  by  Youla  [25  ]  in  the  context  of  his  study  of  the  stable 
stabilization  problem  and  is  repeated  here  without  proof. 

Youla ’s  Lemma:  Let  ($.,/»,)  be  a  finite  symmetric  set  of  complex 
2-tuples.  Then  there  exists  a  miniphase  function  with  real  coefficients  which 
interpolates  (sjtm.)  if  and  only  if  the  set  of  m(.  corresponding  to  the  J,  lying 
on  the  positive  real  axis  all  have  the  same  sign. 

Note  that  this  condition  is  extremely  weak  since  we  need  only  check  st  which- 
are  on  the  positive  real  axis.  Of  course,  since  (*,,/»,-)  is  symmetric  these  m, 
must  be  real  though  they  need  not  all  be  of  the  same  sign.  The  necessity  of 
the  condition  is  clear  for  if  two  such  mt  were  of  different  signs  the  continui¬ 
ty  of  m(s)  along  the  positive  real  axis  would  imply  that  m(s)  had  a  zero  on 
the  positive  real  axis  and  hence  it  would  not  be  miniphase.  The  sufficiency 
of  the  condition  which  is  far  less  obvious  was  verified  by  Youla  et  al.  in 
Reference  9. 

To  apply  Youla’s  lemma  to  our  interpolation  problem  we  must  check  the 
sign  of  d.(st)  /d-(Sj)  for  those  r(  which  represent  zeros  of 
n.(s)dj{s)  -n^s)d-(s)  lying. on  the  positive  real  axis  (including  the  point 
at  infinity).  Since  we  are  not  interested  in  the  value  of  d-{s,) /d^Sg)  but 
only  its  sign  the  desired  information  can  be  obtained  by  looking  at  the  zero 
crossings  of  the  functions  d^(s)  and  dp(s)  along  the  positive  real  axis.  By 
hypothesis  these  functions  have  no  common  right  half-plane  zeros  and, 
hence,  as  long  as  d-(s)  and  d^(s)  taken  together  have  an  even  number  of 


zeros  crossing  between  each  positive  reai  axis  zero  of  nfiis)dpis) 
-n^(s)df(s)  the  sign  of  d^<s)/d-is)  will  remain  constant  at  these  zeros. 
Finally,  recognizing  that  the  right  half-plane  zeros  of  d-is)  and  d-is)  are 
just  the  right  half-plane  poles  of  pis)  and  pis)  we  obtain  the  following 
theorem  [7], 

Simultaneous  Stabilization  Theorem:  Let  pis)  and  pis)  be  distinct 
plants  with  coprime  fractional  representations  pis)  *  n-pis)  !d-p{s)  and 
pis)  »  n^isMdpis)  where  d^is)  and  n^is)  are  coprime.  Tnen  there  ex¬ 
ists  a  compensator  which  simultaneously  stabilizes  both  plants  if  and 
only  if  pis)  and  pis)  taken  together  have  an  even  number  of  po;es  be¬ 
tween  every  pair  of  positive  real  axis  zeros  of  n^is)d^is)  -n^is)d^(s). 

The  conditions  for  simultaneous  stabilization  implied  by  the  theorem  are 
actually  quite  weak  in  that  more  often  than  not  rt^is)d^is)  —  rtpis)d^is) 
has  less  than  two  positive  real  axis  zeros  in  which  case  the  conditions  of  the 
theorem  are  trivially  satisfied.  Although  our  theorem  and  its  proof  is  quite 
complex  the  resultant  test  for  simultaneous  stabilization  is  extremely  simple 
as  illustrated  by  the  pole-zero  plots  of  Figure  5.  Here,  the  plot  of  Figure  5a 
corresponds  to  a  pair  of  plants  which  admit  simultaneous  stabilization  since 
there  are  an  even  number  of  poles  between  every  pair  of  positive  real  axis 
zeros.  On  the  other  hand  the  plot  of  Figure  5b  corresponds  to  a  pair  of 
plants  which  do  not  admit  simultaneous  stabilization  since  there  is  an  odd 
number  of  poles  between  one  pair  of  zeros.  Note,  in  both  of  these  examples 
the  poles  of  pis)  and  pis)  are  treated  together  and  we  do  not  have  to 
distinguish  between  the  poles  of  one  plant  and  those  of  the  other. 


a) 


■W"  - 0 - M - M- 


O - N - M - M- 


b) 


■O'  * - H - X - W- 


Figure  5.  Pole-zero  plots  for  the  simultaneous  stabilization  problem. 


Example.  Once  again  consider  the  plant 


Pis) 


!  is* })  J 
L  (S2  -  -*)  i 


(r+i) 


L(*  +  2)3J 


f  U-2) 


n^is) 


*5is) 


(9.20) 


and  let  us  attempt  to  simultaneously  stabilize  this  plant  and 


Now, 


p(s ) 


(S  +  A) 


(s  +  /4) 

(f  +  2) 

(J-l) 
(s+  2) 


»p(s) 

dpis) 


« p(*)dp(s )  -n^(s)d-p(s) 


M-2)t-(l  +  2i4) 

(f  +  2)2 


(9.21) 


(9.22) 


has  zeros  at 


(l  +  H) 
(A-  2) 


(9.23) 


and  j,*#  while  the  plants  have  poles  at  s  *  I  and  s  -  2  as  indicated  in 
Figure  6.  The  conditions  of  the  theorem  are  therefore  satisfied  iff,  >2  or  r, 
<  1 .  Indeed,  a  little  algebra  will  reveal  that 


while 


j,  >  2  if  A  >2 


(9.24) 


j,<lif-3<X<2  (9.25) 

which  correspond  to  the  two  cases  in  which  the  two  plants  may  be 
simultaneously  stabilized.  On  the  other  hand 

1  <  f,  <  2  if  A  <  -3  (9.26) 

To  complete  our  example  let  us  consider  the  case  where  A  =  3  in  which 
case  s,  «  7  and  the  two  plants  are  simultaneously  stabilizable.  Indeed,  to 
construct  the  required  compensator  we  require  a  miniphase  function,  m(s ) , 
such  that 

df)  6 

m(7)  .  .  _  (9.27) 

dpi  7)  5 


and 


m(  oo ) 


dp(») 

dp{<») 


=  1 


(9.28) 


Since  both  values  are  positive  Youla’s  lemma  guarantees  the  existence  of 


such  a  function.  Indeed, 

m(j)  »  -  (9.29) 

U+3) 

suffices.  Finally  upon  substituting  this  choice  of  mis )  into  9.1 1  the  required 
w(s)  is  obtained. 

Unlike  our  previous  design  problems  in  which  a  complete  parameteriza¬ 
tion  of  the  appropriate  family  of  compensators  was  obtained  here  we  have 
only  given  an  existence  criteria  and  specified  a  method  for  constructing  one 
such  design  when  it  exists.  Although  a  family  of  such  compensators  exists 
no  simple  parameterization  by  the  stable  function  is  known.  Indeed,  it  is  ap¬ 
parent  from  the  abstract  solution  to  the  general  simultaneous  stabilization 
problem  of  Reference  5  that  no  simple  parameterization  exists. 


Figure  4.  Positive  real  axis  pole-zero  {riot  for  Example  31. 

10.  Stable  sUMUzstkm 

Occasionally,  rather  than  simply  designing  a  compensator  to  stabilize  a 
feedback  system  we  require  that  the  compensator,  itself,  also  be  stable 
[25].  Unlike  the  general  stabilization  problem  for  which  a  solution  always 
exists  a  stable  stabilizing  compensator  may  fail  to  exist.  Fortunately,  the 
problem  of  finding  a  stable  stabilizing  compensator  is  a  special  case  of  the 
stimultaneous  stabilization  problem  and,  as  such,  we  may  derive  criteria  for 
the  existence  of  a  stable  stabilizing  compensator  from  the  simultaneous 
stabilization  theorem  [7,25]. 

52.  Lemma.  A  compensator  c(s)  is  stable  if  and  only  if  it  stabilizes  the 
zero  plant  (p(s)  ■  0). 

Proof.  This  follows  immediately  from  Equations  2.2  and  2.4  upon  ob¬ 
serving  that  when  p(s)  ■  0  the  only  non-triviai  feedback  system  gain  is 
e(s). 

Consistent  with  the  lemma  the  problem  of  finding  a  stable  stabilizing 
compensator  for  pis)  is  equivalent  to  the  simultaneous  stabilization  prob¬ 
lem  with  Pis)  ■  pis)  and  Pis)  ■  0.  Thus  upon  letting  n-is)  *  np(s), 
d-is)  ■  dp{s),  n-is)  ■  Oan d  d^(s)  ■  1  in  the  simultaneous  stabilization 
theorem  we  obtain  the  following  [7,25]. 

Stable  Stabilization  Theorem:  Given  p (5)  there  exists  a  stable  stabiliz- 
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ing  compensator  for  the  feedback  system  of  Figure  1  if  and  only  if  an 
even  number  of  poles  of  p(s)  lie  between  every  pair  of  positive  real  axis 
zeros  of  p(s). 

Of  course,  one  can  construct  a  stable  stabilizing  compensator  when  it  exists 
via  the  same  technique  used  in  the  previous  section  to  construct  a  solution  to 
the  simultaneous  stabilization  problem.  That  is,  a  miniphase  function  which 
interpolates  1  /d  (s)  at  the  (closed)  right  half-plane  zeros  of  p(s)  is  con¬ 
structed  and  used  in  Equation  9.1 1  to  construct  the  appropriate  w(s) .  Also 
as  in  the  case  of  the  simultaneous  stabilization  problem  no  parameterization 
of  the  stable  stabilizing  compensators  is  known. 

33.  Example.  Once  again  let  us  consider  the  plant 


16  f  (s+1) ' 

3  [(5  +  2)’. 


U-K2/3)'  [(5-2)' 

U  +  2)  _  L<5  +  2>. 


up(s)np(s)  +  vp(s)dp(s) 


(10.1) 


(10.2) 


Now,  the  only  positive  real  axis  zero  of  p(s)  is  at  s  *  oo  hence  the  existence 
of  a  stable  stabilizing  compensator  is  assured.  To  construct  the  required 
compensator  we  must  find  a  miniphse  function,  m(s),  such  that  m(<x)  ** 
l/efp(oo)  m  1.  Clearly 

m(s)  «  1  (10.3) 

will  suffice.  Next  we  compute  w(s)  via  equation  8.11  which  reduces  to 


W(5) 


m(s)  -vp(s) 
np{s) 


(10.4) 


for  the  stable  stabilization  problem.  Substituting  m(s),  np(s )  and  vp(s) 
from  10.2  and  10.3  into  10.4  now  yields 


4(5  +  2) 
3(5+1) 


(10.5) 


Finally,  this  value  of  w(s)  is  substituted  into  the  compensator  formula  for 
this  plant  given  in  Equation  2.37.  After  some  algebra  we  conclude  that 


(205  +  8) 

c(5)  *  - 

3(5+1) 

is  the  required  stable  stabilizing  compensator. 


(10.6) 
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11.  Optimization  of  system  performance 

The  final  step  in  the  feedback  system  design  process  is  the  use  of  any  design 
latitude  which  remains  after  all  design  constraints  have  been  achieved  to  op* 
timize  the  system  performance.  This  may  represent  z  classical  optimal  con¬ 
trol  type  of  minimization  using  a  weighted  sum  of  the  regulation  error  and 
the  norm  of  the  plant  input  or  it  may  represent  a  more  qualitative  measure 
of  system  performance;  reliability,  sensitivity,  etc.  Needless  to  say  the 
precise  parameters)  which  one  might  desire  to  optimize  is  highly  dependent 
on  the  physical  system  under  design  and,  thus,  beyond  the  realm  of  the  pre¬ 
sent  discussion. 

Whatever  parameter  one  chooses  to  optimize,  the  key  to  the  optimization 
process  lies  with  the  simple  affine  nature  of  the  feedback  system  gains  for¬ 
mulated  in  Corollary  6.  Indeed,  each  of  these  gains  is  affine  in  the  design 
parameter  w(s).  Moreover,  when  one  includes  a  tracking  or  disturbance  re¬ 
jection  constraint  th‘e  subset  of  allowed  w(s)  is  also  affine  in  the  new  design 
parameter,  say  g(s).  Since  the  composition  of  affine  functions  is  affine  the 
system  gains  remain  affine  in  the  stable  design  parameter,  i.e. 

h(s)  ■  x(s)gis)  *y{s)  (11.1) 

where  Ar(s)  is  the  appropriate  gain,  x(s)  and  y(s)  are  given  stable  func¬ 
tions,  and  g(s)  is  a  stable  design  parameter.  As  such,  the  optimization  of 
one  or  more  of  the  system  gains  and/or  the  system  responses  to  a  specified 
class  of  inputs  usually  proves  to  be  a  straightforward  process. 

In  particular,  if  one  assumes  that  and  tu  are  specified  stochastic  pro¬ 
cesses  and  desires  to  minimize  the  expected  "value  of  a  weighted  sum  of 
system  responses  a  classical  Wiener-Hopf  optimization  problem  results 
[8,23,24].  Similarly,  the  various  deterministic  optimization  problems  one 
might  choose  to  investigate  are  greatly  simplified  by  the  affine  nature  of 
10.1  which  permits  the  norm  of  any  system  response  to  be  expressed  as  a 
quadratic  function  of  jr(r) .  As  such,  these  problems  are  usually  amenable 
to  a  simple  completion  of  the  squares  solution. 

Rather  than  going  through  a  formal  optimization  process  at  this  stage  the 
system  designer  may  simply  choose  to  “optimize"  some  qualitative  charac¬ 
teristic  of  the  system;  say,  its  sensitivity.  Recall  that  the  classical  Bode  sen¬ 
sitivity  measure  for  a  feedback  system  is  just 


S(s)  *  - - -  *  A,lM.(r> 

1 +P(J)C(J) 

*  .w(s)np(s)dp(s)  -vp(s)dp(s) 


(11.2) 


hence  to  minimize  the  sensitivity  of  our  feedback  system  it  suffices  to  make 
S(s)  small  in  some  sense.  To  this  end  one  may  go  through  a  formal  op¬ 
timization  procedure;  say,  a  Chebychev  minimization  of  S(s )  over  a 
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prescribed  frequency  range;  or  simply  choose  w(s)  so  that  S  has  one  or 
more  imaginary  axis  zeros  in  the  prescribed  frequency  range.  Indeed,  this  is 
the  case  in  the  following  example. 


34.  Example.  For  the  plant 


P(s)  - 


(s+1) 
U2-  4) 


r  <£±i! 

_  [(5  +  2)^  V-> 


1  (11.4) 


L  U2-4)  J  r  is- 2)'  asu) 

where  + 

—  [ !i^>]  = 

3  l(j  +  2)*J  l  (s  +  2)  J  [(s  +  2)J  p  p  p  p 

S(s)  takes  the  form 

„  .  ,  .  f  U+D  1  f <i-2)1  r (s  ■+•  2/3)1  r<J-2)‘ 

S(s)  *  w(s)  -  - -•  j -  - 

(j  +  2)2J  L(^  +  2)J  l  (s  +  2)  J  |_U  +  2) 

(11.5) 

[w(s)(s+l)+ (J2  +  8/3s  +  4/3)  j(s-2) 

<s  +  2)3 

Now,  to  minimize  the  sensitivity  of  our  system  we  will  choose  a  stable  w(s) 
which  will  place  zeros  on  the  imaginary  axis  in  the  frequency  range  of  in¬ 
terest.  If  we  take 


M(5)  = 


( As  +  B ) 


it  will  be  stable  for  all  choices  of  A  and  B  and  yield 


S(s)  = 


(52  +  (A  +  8/3)s  +  (B  +  4/3)  1(5-2) 


which  will  allow  us  to  place  a  pair  of  zeros  anywhere  on  the  complex  plane. 
For  instance,  if  we  let  A  -  -  8/3  and  B  *  -  1/3  10.7  becomes 

[52+;]<j-2) 

*s)  =  - - - - -  (11.8) 

(s  +  2)3 

w  hich  has  zeros  at  ±j.  Of  course,  by  using  a  more  complex  w(s)  additional 
zeros  can  be  created. 

12.  Historical  notes,  generalizations,  and  conclusions 
Although  many  of  the  concepts  described  in  the  preceding  have  a  long 
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history  in  single-varia  s  control  theory,  our  purpose  has  been  to  use  the 
single-variate  case  as  a  forum  in  which  to  survey  the  algebraic  approach  to 
the  feedback  system  design  problem  developed  during  the  past  few  years 
[4,8,11,16,17,23,24].  Indeed,  a  review  of  the  preceding  eleven  sections  will 
reveal  that,  with  few  exceptions,  our  theory  has  been  formulated  with  no 
more  complex  mathematics  than  addition,  subtraction,  multiplication,  and 
division.  As  such,  one  should  not  be  surprised  to  find  that  most  of  the 
theory  can  be  extended  to  the  multivariate  case  [  1 1 ,16,24  ]  while  much  of  it 
extends  to  the  case  of  a  general  linear  system;  be  it  time-varying,  distributed 
or  multidimensional  [4,8,10].  The  purpose  of  this  concluding  section  is 
therefore  to  review  the  results  and  literature  pertinent  tc  our  algebraic  ap¬ 
proach  to  the  feedback  system  design  problem. 

Although  the  polynomial  fractional  representation  concept  is  implicit  in 
any  single-variate  system  theory  it  was  not  adapted  to  the  multivariate  case 
until  the  mid-sixties  with  the  work  of  Rosenbrock,  et  al  [2,15  ] .  Since  such  a 
polynomial  fractional  representation  does  not  admit  an  obvious  generaliza¬ 
tion  to  the  distributed  case,  several  researchers  began  to  search  for  an  alter¬ 
native  during  the  mid-seventies,  eventually  settling  on  a  fractional  represen¬ 
tation  theory,  wherein,  an  arbitrary  system  is  represented  as  the  ratio  of  two 
stable  systems  [4,8] .  This  was  first  used  in  the  distributed  system  theory  of 
Callier  and  Desocr  [4],  then  extended  to  an  operator  theoretic  framework 
[19],  and  finally  to  an  abstract  ring  theoretic  setting  [8]  in  the  late  seven¬ 
ties. 

The  starting  point  for  our  theory,  however,  rests  with  the  celebrated  1576 
papers  of  Youla,  Bongiomo,  and  Jabr  [23,24]  in  which  a  complete 
parameterization  of  the  set  of  stabilizing  compensators  for  a  general 
multivariate  system  was  first  formulated.  Of  course,  such  parameterizations 
have  long  been  known  in  the  single-variate  case  while  several  authors  gave 
alternative  formulations  [1,3,5,6,9,13,14,15,20,22,26].  Although  the  YBJ 
theory  was  originally  formulated  using  a  polynomial  fractional  representa¬ 
tion  [23,24],  it  was  soon  discovered  that  the  theory  could  be  both 
simplified  and  generalized  by  working  with  stable  factors  rather  than 
polynomial  factors.  This,  in  turn,  led  to  both  distributed  [4]  and  ring 
theoretic  [8]  formulations  of  the  stabilization  theorem.  More  recently  a 
global  formulation  of  the  YBJ  theory  has  been  given  an  algebro-geometric 
setting  [17]. 

Although  the  obvious  motivation  for  formulating  the  feedback  system 
design  problem  in  a  ring  theoretic  setting  is  to  permit  its  generalization  to 
multivariate,  time-varying,  and  distributed  systems,  two  additional  benefits 
have,  in  fact,  accrued  from  the  theory.  First,  the  ring  theoretic  setting  forces 
one  to  adopt  purely  algebraic  arguments  which  are  often  simpler  than  the 
analytic  arguments  used  in  the  polynomial  and  rational  function  ap¬ 
proaches.  Second,  rather  than  working  with  the  full  ring  of  stable  systems 
one  can  work  in  a  subring,  thereby  obtaining  a  design  theory  for  “ strongly 
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stable"  systems  [8].  In  particular,  this  allows  the  YBJ  stabilization  theory 
to  be  immediately  translated  into  a  complete  parameterization  of  the  set  of 
compensators  which  place  the  poles  of  a  feedback  system  in  a  prescribed 
subset  of  the  (strict)  left  half-plane.  Indeed,  the  set  of  transfer  functions 
with  poles  in  such  a  subset  form  a  ring  which  can  be  used  to  derive  an  alter¬ 
native  pole  placement  theory  simply  by  representing  p  (s)  as  the  ratio  of  two 
functions  with  poles  in  the  prescribed  subset  and  similarly  for  up(s),  vp(s), 
etc.  Unlike  the  formulation  of  Section  8,  however,  one  cannot  precisely 
specify  the  pole  locations  and  multiplicities  via  such  an  approach. 

In  addition  to  the  formal  stabilization  theory  the  YBJ  theory  also  has  had 
a  significant  impact  on  the  " philosophy  of  system  design  ”.  Indeed,  with  the 
advent  of  the  YBJ  theory  the  importance  of  parameterizing  the  entire 
family  of  systems  which  achieve  the  design  constraints  rather  than  simply 
specifying  a  single  design  was  made  apparent.  In  the  original  papers  of 
Youla,  Bongiorno,  and  Jabr  the  parameterization  became  the  first  step  in  a 
Wiener-Hopf  optimization  [23,24]  over  the  set  of  stabilizing  compen¬ 
sators.  As  such,  the  parameterization  led  to  an  explicit  description  of  the 
constraint  set  for  the  optimization  problem,  thereby  permitting  one  to  carry 
out  an  unconstrained  optimization  over  w(s)  rather  than  a  constrained  op¬ 
timization  over  c(s).  Of  course,  the  YBJ  philosophy  has  been  continued 
with  the  parameterization  of  the  set  of  compensators  which  solve  the 
various  tracking  and  disturbance  rejection  problems  1)1,16) .  Moreover,  it 
has  since  been  carried  over  to  other  fields  of  endeavor  by  Youla,  who  has 
parameterized  the  complete  set  of  stochastic  processes  which  are  compatible 
with  observed  data  [22];  and  by  Helton,  who  has  given  a  complete 
parameterization  of  the  set  of  impedances  which  are  compatible  with  a 
prescribed  load  [  12] .  This,  in  turn,  was  then  employed  together  with  a  non- 
Euclidian  optimization  algorithm  to  resolve  a  long  standing  broadband 
matching  problem. 

The  tracking  and  disturbance  rejection  problems  have  their  origins  in 
classical  single-vairate  control  theory  where  the  final  value  theorem  is  used 
to  formulate  the  required  criteria.  The  origin  of  the  present  formulation, 
however,  rests  with  the  work  of  Francis  [!0]  who  formulated  a  divisability 
criterion  for  the  solution  of  the  tracking  problem  and  the  work  of  Callier 
and  Desoer  [4]  who  first  integrated  the  YBJ  philosophy  with  the  tracking 
and  disturbance  rejection  problem.  The  present  formulation  is,  however, 
based  on  that  of  Saeks  and  Murray  [16]  who  derived  the  design  equations 
presented  here  in  a  general  ring  theoretic  setting  and  formulated  the 
coprimeness  criteria  for  the  existence  of  a  solution  in  a  multivariable 
setting. 

Although  robustness  has  been  a  topic  of  wide  interest  throughout  control 
theory  over  the  past  decade  the  present  formulation  dates  only  to  the  work 
of  Francis  [10]  in  1977  and  is  based  on  the  more  recent  work  of  Francis  and 
Vidyasagar  ( 11  ] .  To  our  knowledge,  however,  the  explicit  parameterization 


of  the  compensators  which  resolve  the  robust  tracking  and/or  disturbance 
rejections  problems  has  not  previously  appeared. 

Unlike  the  topics  considered  in  Sections  !  through  8  which  have  been  of 
interest  to  control  theorists  fot  a  number  of  years  the  simultaneous  design 
problem  has  only  been  recently  formulated  in  an  effort  to  develop  an  "ultra 
robust”  control  theory  [7,17,19].  Indeed,  the  solution  of  the  two  plant 
problem  presented  here  was  first  derived  by  Chua  [7]  and  has  not  pre¬ 
viously  appeared  in  the  literature  while  the  two  plant  multivariate  problem 
has  been  resolved  by  Vidyasagar  and  Viswanadham  [19].  At  the  time  of 
this  writing  the  two  plant  problem  is  the  only  simultaneous  stabilization 
problem  for  which  an  explicit  frequency  domain  solution  is  known,  indeed, 
the  techniques  used  herein  do  not  extend,  even  to  the  three  plant  case,  in  any 
obvious  manner  [7].  Interestingly,  however,  a  complete  solution  to  the 
general  simultaneous  stabilization  problem  has  been  given  in  an  abstract 
aigebro-geometric  setting  [17]  though  no  practical  method  for  implement¬ 
ing  this  theory  is  available.  In  essence,  the  simultaneous  stabilization  prob¬ 
lem  is  a  global  problem  in  which  one  must  characterize  the  properties  of  the 
entire  set  of  plants  one  desires  to  stabilize  rather  than  simply  give  a  local 
parameterization  of  the  elements  of  that  set.  As  such,  it  has  been  necessary 
to  formulate  the  problem  in  an  abstract  setting  wherein  the  easy  implemen- 
tability  of  our  theory  is  lost. 

A  solution  to  the  stable  stabilization  problem  was  first  presented  by 
Youla,  Bongiomo,  and  Lu  [25],  even  before  the  publication  of  the  YBJ 
theory,  though  it  was  essentially  unknown  until  the  relationship  between  the 
stabilization  problem  and  the  simultaneous  stabilization  problem  was  un¬ 
covered.  Unlike  the  remainder  of  our  theory  the  solution  to  the  two  plant 
simultaneous  stabilization  problem  and  the  related  solution  ot  the  stable 
stabilization  problem  are  highly  pole-zero  oriented  and,  as  such,  it  is 
unclear  whether  or  not  they  admit  a  viable  extension  beyond  the  multi- 
variable  case.  Interestingly,  however,  when  the  pole-zero  interlacing  proper¬ 
ty  of  Section  10  fails,  one  can  determine  the  minimum  number  of  unstable 
poles  required  by  any  stabilizing  compensator  simply  by  counting  the 
number  of  times  the  plant  has  an  odd  number  of  poles  between  a  pair  of  its 
positive  real  axis  zeros  [19].  Finally,  we  note  that  the  “optimization”  tech¬ 
niques  are  matched  to  our  theory.  Indeed,  the  relationship  between  the 
stabilization  theory  and  the  Wiener-Hopf  theory  was  invoked  in  the  original 
YBJ  papers  [23,24]. 


1  This  journal.  CSSP,  Voi.  1,  No.  2.  1982.  pages  137  to  169. 

*  To  be  rigorous  one  must  invoke  some  sophisticated  mathematics  to  prove  that  there  exists 
a  dystsi  »h:ch  is  coprime  with  d,  (s)  in  ever,  neighborhood  of  dp(s)  (11).  As  such,  cancella¬ 
tion  cannot  be  guaranteed  for  every  dpls'i  though  some  d^(s)  may  cancel  the  instabilities  in 
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•  Although  the  theorem  allows  for  the  possibility  that  right  half-plane  zeros  of  pis)  are 
matched  by  corresponding  zeros  in  h(s)  and  right  half-plane  poles  of  pis)  are  matched  by  cor¬ 
responding  poles  of  U  -  h\s)  1  such  matching  is  not  robust  and  thus,  for  all  practical  purposes 
the  transfer  function  design  theorem  requires  that  pis)  be  miniphase. 

4  If  one  assumes  that  o(g)  fc  rip)  -  1  as  per  the  theorem  then  the  bound  oiq)  -  rip)  is 
greater  than  or  equal  to  - 1 .  In  particular,  this  bound  is  equal  to  -  1  when  o(q)  *  »(p)  - 1  in 
which  case  8.21  admits  a  unique  solution  and  no  design  freedom  is  allowed. 
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A8STRACT 

The  problem  of  designing  a  feedback  controller 
which  stabilizes  a  number  of  plants  simultaneously 
is  discussed  from  the  fractional  representation 
point  of  view.  An  abstract  solution  of  this  gen¬ 
eral  simultaneous  stabilization  problem  is  present¬ 
ed,  and  an  elementary,  explicit  criterion  is  given 
for  the  simultaneous  stabilizability  of  two  systans. 
Finally,  some  examples  and  counter  examples  are 
presented,  and  some  open  problems  a_re  discussed. 


unstable)  stabilizing  compensator  for  a  given  plant 
is  reviewed. 

2.  SIMULTANEOUS  STABILIZATION 
AND  STABLE  STABILIZATION  ' 

We  consider  the  feedback  system  shown  in  Fig. 
1;  this  is  characterized  by  the  connection  equa¬ 
tions 


1.  INTRODUCTION 
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Classically,  in  control  theory  one  is  given  a 
plant  and  desires  to  design  a  control  system  a- 
round  this  plant  which  meets  certain  design  spec¬ 
ifications.  In  fact,  however,  a  “real  world* 
plant  is  never  known  exactly  and,  as  such,  a  rea¬ 
listic  design  must  simultaneously  meet  specifica¬ 
tions  over  an  entire  range  of  plants  which  (hope¬ 
fully)  include  the  actual  plant.  The  simplest 
form  of  the  resultant  UmJJUu teoua  du-ign  pnoblen 
is  the  *obtU£  diUgn  paobiem  wherein  one  desires 
to  meet  the  design'specifi cations  in  an  c-ball 
around  a  prescribed  nominal  plant.  Although  this 
is  satisfactory  for  dealing  with  modeling  errors  it 
cannot  cooe  with  plants  containing  unknown  para¬ 
meters  and/or  plants  characterized  by  multiple 
modes  of  operation.  For  instance,  the  dynamics  of 
an  airplane  or  rocket  vary  widely  with  altitude 
while  the  dynamics  of  an  electric  motor  change  with 
speed  and  load.  To  cope  with  these  problems  we 
muSt  formulate  a  iimuliAntom  design  zhtox-u  in 
which  one  designs  a  control  system  to  simultaneous¬ 
ly  meet  specifications  over  a  prescribed  set  of 
plants.  Of  course,  the  set  of  plants  may  be  taken 
to  be  a  ball  in  which  case  the  classical  robustness 
theory  is  replicated.  Alternatively,  one  may 
choose  to  work  with  a  set  of  plants  in  which  one  or 
more  parameters  vary  over  a  prescribed  range  and / 
or  a  discrete  set  of  plants;  say,  the  dynamics  of 
a  two  speed  motor  ir.  its  high  and  low  speed 
settings. 
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To  describe  the  dynamics  of  the  plant  and 
controller,  we  use  the  abstract  fractional  repre¬ 
sentation  theory  of  [13,  [2],  T33.  This  assumes 
four  sets 

?  3  h  3  1  P  j 

where  g  is  a  ring  with  identity  which  represents 
the  general  class  of  system  with  which  we  wish  to 
work,  h  is  a  subring  of  g  corresponding  to  the 
stable  systems  in  §,  i  is  a  multiplicative  set 
consisting  of  the  elements  in  h  wnich  have  inverses 
in  g,  and  j  is  the  subgroup  of  h  which  consists  of 
the  elements  of  h  which  have  inverses  in  h. 


or 


where 
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The  purpose  of  this  paoer  is  to  review  the 
state  of  research  on  the  simultaneous  design  prob¬ 
lem  including  the  derivation  ef  an  explicit  cri¬ 
terion  of  the  simultaneous  stabilization  of  two 
distinct  plants  and  an  algebro-geometric  solution 
of  the  general  simultaneous  stabi lization  problem. 
In  addition,  the  fundamental  relationship  between 
the  simultaneous  stabilization  problem  and  the 
problem  of  designing  a  stable  (or  minimally 


We  assume  that  the  plant  ?  has  a  richt-coprin* 
fractional  representation 

? .  n^;1 

where  the  coprimeness  is  exhibited  by 
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*n£  also  a  l#f t-coprime  representation 
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It  has  been  shown  rC2]  that  one  can  then  find  U{ 

and  V  so  that,  in  addition, 
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Thus  the  plant  P  can  be  described  by  the  matrix 
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or  its  inverse 
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The  admissible  input-output  peirs(c  ,  i»  )  for  the 
plant  ere  then  describee  by  p  P 
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where  c_  is  a  kind  of  “partial  state"  for  the 
P 

plant  P. 

The  corresponding  matrices  for  the  controller 
will  be  denoted  by  Rc  and  Sc,  etc.. 

It  can  then  be  shown  [31.  T*]  that  a  given  con¬ 
troller  C  will  stabilize  a  plant  P  if  and  only  if 
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where  0  is  the  connection  matrix  (2.1),  and  V  is  of 
the  fora 


w*-e  each  matrix  is  in  GLj(h)  -  i.e.,  is  a  ZxZ 
matrix,  c*  elements  cf  h  which  has  an  inverse  whose 
a! events  ere  in  h.  It  is  easy  to  see  that  berno  of 
'err,  (2.2)  is  ecui valent  to  being  an  element 
•'  ul : ;  n }  wnese  (1 ,1  '-element  is  in  j,  end  that 
nis  ir.  turn  is  equivalent  to  being  the  R-matrix 
f  a  stable  system.  Thus  in  terms  of  the  R-matrix 
esresentation,  we  can  restate  (2.2)  as 


Theorem  1 :  [4] 

The  set  of  plants  stabilized!*  by  a  compen¬ 
sator  C  is  precisely  the  set  of  stable  plants 
transformed  by  left  multiplication  by  QR^1 

This  can  also  be  resta~.ee  as  follows:  A  set 
P  of  plants  is  simultaneously  stapilizable  iff 
P  lies  in  an  image  of  tne  stables  under  left  mul¬ 
tiplication  by  some  element  of  £Lh(2). 

These  criteria,  while  geometrically  appealing, 
can  not  be  handled  analytically:  the  following 
epui valent  criterion  is  therefore  useful. 

Theorem  2:  ft] 

Let  P  "be  a  set  of  plants  represented  by 
Rp,  p  t  P.  The  the  set  P  is  simultaneously  stab¬ 
ilized  e  iff  there  exists  a  family  of  matrices 
Wp,  p  c  P,  of  the  form  (2.3)  such  thet 


.vmcp. 


Proof: 


The  necessity  of  this  condition  is  obvious 
from  (2.2).  The  sufficiency  can  be  checked  by  de¬ 
fining 

Rc  *  Ofyp1  0 

fer  any  p  c  P,  and  using  the  condition  to  check 
that  Rc  is  well  defined.  Then  (2.2)  follows. 

One  can  Insert  the  requirement  that  the  com¬ 
pensator  be  stable  merely  by  adjoining  the  zero 
plant  to  the  given  set  of  plants  -  a  compensator 
is  stable  if  and  only  if  it  stabilizes  the  zero 
plant.  Thus  the  stable  stabilization  of  n  plants 
simultaneously  cart  be  treated  es  a  problem  of  sim¬ 
ultaneously  stabilizing  n+1  plants.  The  converse, 
although  less  obvious,  is  rlso  true  -  see  C3j. 

3.  THE  TWO-PLANT  CASE 

The  only  ease  of  which  we  know. in  which  even 
the  analytic  criterion  in  section  2  car.  be  imple¬ 
mented  is  the  case  of  two  plants.  In  this  cese, 
th*  criterion  is  as  follows:  two  plants,  with  rep¬ 
resenting  matrices  R^  end  R»,  can  be  simultaneously 

stabilized  iff  there  exist  matrices  and  of 
the  form  (2.3)  such  that 

<’"2  "  *]\  (2.1) 

We  note  that,  since  the  identity  R-matrix 
corresponds  to  the  zero  plant,  tms  is  equivalent 

to  the  condition  that  the  "plant"  Rl'R.  neve  a 

stable  stabilizing  compensator.  (This  is  an  ex¬ 
ample  of  the  converse  mentioned  st  the  end  of 
section  2.)  For  the  linear  time-invariant  case, 
the  problem  has  been  solved  by  Youla  [5).  It  is 
of  interest,  however,  to  relate  voula's  solution 


to  our  approach.  To  this  end,  we  denote  R^  R2  by  EXAMPLES:  [<) 


u 

* 

_N 


and  restrict  ourselves  to  the  scalar-input  scalar- 
output  case.  If  we  write  (3.1)  in  the  form 

W^R  •  W2 

and  use  the  fact  that  the  (1,1) -entry  in  each,  of 
the  W-matrices  must  be  in  j,  we  get  the  condition: 


D  +  TN  e  j  for  some  t  e  h. 

k 

In  the  present  case,  this  is  clearly  equivalent  to 
requiring  the  existence  of  a  stable,  minimum-phase 
transfer  function  which,  at  each  closed  right  half¬ 
plane  zero  of  N,  interpolates  0  to  an  order  equal 
at  least  to  the  order  of  the  2ero  of  N.  Continuity 
and  realness  of  the  transfer  functions  show  that  a 
necessary  condition  for  this  is  that  D  have  the 
same  sign  at  all  closed  positive  reel-axis  zeros  of 
N.  Youle  showed  that  this  was  also  sufficient. 

Thus  to  solve  the  problem  of  simultaneously 
stabilizing  two  plants,  it  remains  only  to  express 

K  and  D  in  terms  of  the  original  plants.  An  easy 
calculation  shows  that  we  can  take 

N  «  K201  - 

and  D  «  V.,D2  *  U.,K2  . 


Thus  in  order  for 
eouslv  stabilizable,  it 
ier.t  that  *  U-jN-j 

closed  positive  reel-ax 

Some  calculations  invol 
ditions  show  that  this 
dition  thet  D^/D,  have 
closed  positive  real -ex 

This  gives  a  criterion 
functions  themselves  3 


the  plants  to  be  simultan- 
is  necessary  and  suffic- 
have  the  same  sign  at  all 

is  2ercs  of  N‘2D1  - 

ving  the  coprimeness  con- 
is  equivalent  to  the  con- 
the  same  sign  at  ell 
is  zeros  of  -  n^Dj. 

ir  terms  of  the  trensfer- 
]F4]. 


4.  EXAMPLES  Aio  PROBLEMS 

The  or.'y  case  in  which  the  geometric  results 
can  be  illustrated  on  paper  is  the  case  in  wnich 
there  ere  only  two  parameters.  For  this  reason  our 
first  example  will  peal  with  this  situation. 


Suppose  given  a  family  of  plants  of  the  f0m 


we  would  like  to  know  when  it  is  possible  to  stab¬ 
ilize  these  simultaneously  by  use  of  a  proportional 
compensator  with  gain  t.  In  this  case  the  denomi¬ 
nator  of  the  closed-loop  transfer  function  is 

d(s)  »  .5  *  (B+tA) 

and  so  the  feedback  system  will  t  stable  iff 
B+Ta  >  0.  Thus  a  set  of  plants  is  simultaneously  -  - 
stabilizable  iff  for  some  t,  B+tA  >  0  for  each 
plant  in  the  set.  Now  each  plant  can  be  represent¬ 
ed  as-  a  point  in  the  (A.B)-plane;  in  this  represen¬ 
tation,  a  set  of  plants  is  simultaneously  stebi- 
lizable  iff  the  set  lies  entirely  above  some 
straight  line  through  the  origin.  The  slope  of 
such  a  line  is  -t,  where  t  is  the  gain  of  a  stab¬ 
ilizing  compensator.  For  example,  the  set  in 
Figure  2.b  is  stabilizable,  while  the  sets  in 
Figures  2.c  and  2.d  are  not.  Since  the  set  of 
stables  in  this  case  is  the  upper  half-plane,  this 
gives  a  very  vivid  (although  very  special)  illus¬ 
tration  of  theorem  1:  a  set  of  dents  is  ximultjin. 
eously  stabilizable  iff  it  is  contained  in  a  ro¬ 
tated  (or  more  accurately,  sheared)  version  of  the 
set  of  stables. 

Our  second  example'  is  a  -counterexample  to 
the  effect  that  even  if  every  pair  of  plants  in  a 
set  is  simultaneously  stabilizable,  the  entire  set 
may  not  be.  To  this  end,  we  take  a  set  consisting 
of  three  plants  (pp.p^pj);  here  pQ  is  the  zero 

plant,  and 

P-,  -  nl/d1  .  p2  -  n2 /d2. 

where  ,  d^,  and  d2  are  graphed  in  Figure  3. 

It  is  easy  tp  see,  by  using  Youla's  criterion,  that 
p^  end  p2  each  have  stable  stabilizing  compensators, 

and  by  using  the  criterion  in  section  3,  that  p-j 

end  p2  have  a  simultaneous  stabilizing  compensator. 

However,  there  is  no  stable  compensator  which  sim¬ 
ultaneously  stabilizes  p,  and  "...  If  there  were, 
then  by  the  criterion  of  Theorem  2  there  would  be 
a  stable  transfer  function  f  such  that  both  d^  + 

fn^  «  r.(  and  d2  *  fn2  *  r2  were  stable  and  minimum- 

phase.  At  the  zeros  cf  n^,  d.  >  0  and  so  r1  must 

be  positive  on  the  positive  real  axis.  Similarly 
r2  must  be  positive  on  the  positive  real  axis. 

However,  if  we  eliminate  f  we  see  that 

n2di  -  -  n^ 

and  so,  at  the  zeros  of  r.2b,  -  n.dj,  we  must  have 
*  nir2-  kut  at  these  points,  >  0  end 
n2  <  0,  and  so  we  get  a  contradiction.  Thus  Pq.P^ 
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and  Pj  can  not  be  simultaneously  stabili2able. 

It  should  be  clear  from  the  above  that  there 
are  more  problems  than  solutions  in  this  area  of 
research.  At  present,  for  example,  we  do  not  know 
any  testable  necessary  and  sufficient  conditions 
for  the  simultaneous  staoil liability  of  even  three 
scalar  plants.  Thus  as  a  first  problem,  we  might 
state: 

Problem  1:  Find  necessary  and  sufficient  condition 
'for  the  simultaneous  stabilizability  of  N(>2) 
plants. 

The  following  problem  may  also  be  of  interest: 

Problem  2:  If  one  knows  that  a  set  of  plants  can 
be  simultaneously  stabilized,  how  does  one  find  all  ■ 
compensators  (or  even  some  compensators)  which  will 
stabilize  the  set? 

Problem  3:  Since  stabilization  alone  is  usually 
not  enough,  can  one  fine  conditions  for  the  exis¬ 
tence  of  a  compensator  which,  in  addition  to  simul¬ 
taneously  stabilizing  a  set  of  plants,  will  also 
cause  them  to  satisfy  some  other  conditions  (e.g. 
track  a  specified  input  signal)? 
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5.  CONCLUSIONS 

We  have  discussed  the  problem  of  finding  a 
compensator  which  stabilizes  every  plant  in  a  given 
set  of  plants.  In  the  abstract,  both  geometric 
and  analytic  criteria  have  been  given  for  the  ex¬ 
istence  of  such  a  compensator.  However,  only  in 
very  special  cases,  such  as  the  case  of  two  plants, 
can  these  criteria  be  checked.  Another  special 
case,  in  which  the  geometric  criterion  becomes 
particularly  clear,  is  the  case  of  first-order 
plants  with  proportional  controllers.  We  have  also 
given  an  example  where  pairwise  simultaneous  stab¬ 
ilizability  of  a  set  of  plants  does  not  imply  over¬ 
all  simultaneous  stabilizability.  Finally,  we  have 
indicated  some  directions  for  further  research. 
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Abstract  —  An  explicit  relationship  between  the  fractional  representation 
approach  feedback  system  design  and  the  aifebro- geometric  approach  to 
system  theory  is  formulated  and  used  to  derive  a  global  solution  to  the 
feedback  system  problem.  These  techniques  are  then  applied  to  the  simul¬ 
taneous  stabilization  problem,  yielding  a  natural  geometric  criterion  for  a 
set  of  plants  to  be  simultaneously  stabilized  by  a  single  compensator. 


I.  Introduction 

CLASSICALLY,  in  control  theory  one  is  given  a  plant 
and  desires  to  design  a  control  system  around  this 
plant  which  meets  certain  design  specifications.  In  fact, 
however,  a  “real  world”  plant  is  never  known  exactly  and, 
as  such,  a  realistic  design  must  simultaneously  meet  specifi¬ 
cations  over  an  entire  range  of  plants  which  (hopefully) 
include  the  actual  plant.  The  simplest  form  of  the  resultant 
v simultaneous  design  problem  is  the  robust  design  problem 
wherein  one  desires  to  meet  the  design  specifications  in  an 
<  ball  around  a  prescribed  nominal  plant.  Although  this  is 
satisfactory  for  dealing  with  modeling  errors,  it  cannot 
cope  with  plants  containing  unknown  parameters  and/or 
plants  characterized  by  multiple  modes  of  operation.  For 
instance,  the  dynamics  of  an  airplane  or  rocket  vary  widely 
with  altitude,  while  the  dynamics  of  an  electric  motor 
change  with  speed  and  load.  To  cope  with  these  problems, 
we  must  formulate  a  simultaneous  design  theory  in  which 
one  designs  a  control  system  to  simultaneously  meet  speci¬ 
fications  over  a  prescribed  set  of  plants.  Of  course,  the  set 
of  plants  may  be  taken  to  be  a  ball,  in  which  case  the 
classical  robustness  theory  is  replicated.  Alternatively,  one 
may  choose  to  work  with  a  set  of  plants  in  which  one  or 
more  parameters  vary  over  a  prescribed  range  and/or  a 
discrete  set  of  plants,  say,  the  dynamics  of  a  two-speed 
motor  in  its  high-  and  low-speed  settings. 

The  simultaneous  design  concept  is  possibly  best  il¬ 
lustrated  in  the  first-order  case  wherein  a  simple  geometric 
solution  suggests  itself.  Assume  that  our  plants  are  of  the 
form 

><*>  =  775  (u> 
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and  we  desire  to  design  a  stable  feedback  system  using  a 
proportional  compensator  with  gain  /.  This  results  in  a 
system  with  characteristic  function 

d(s)  =  s  +  (B  +  tA)  (1.2) 

and,  as  such,  the  feedback  system  will  be  stable  if  and  only 
if  2?  +  l4>0.  Here,  for  a  given  compensator  t,  the  feed¬ 
back  system  will  be  stable  if  and  only  if  the  point  (A.  B) 
lies  above  the  line  with  slope  1/r  as  shown  in  Fig.  1(a).  As 
such,  if  we  want  to  simultaneously  stabilize  an  entire  set  of 
plants,  their  representations  on  the  A-B  plane  must  all  lie 
above  a  line  through  the  origin.  For  instance,  the  set  of 
plants  indicated  by  the  hatched  region  in  Fig.  1(b)  can  be 
simultaneously  stabilized  (by  a  compensator  with  gain 
—  4),  while  the  set  of  plants  shown  in  Fig.  1(c)  cannot  be 
simultaneously  stabilized  since  they  subtend  an  angle 
greater  than  180°  on  the  A-B  plane.  Similarly,  the  set  of 
plants  shown  in  Fig.  1(d)  cannot  be  simultaneously  stabi¬ 
lized  since  they  cross  the  negative  A  axis. 

The  above  example  suggest  two  alternative  criteria  for 
the  simultaneous  stabilization  problem.  One  may  adopt  an 
algebraic  criterion  ti  the  effect  that 

B  +  tA>0  (1.3) 

for  each  plant  in  the  prescribed  set  and  some  r.  While  such 
a  test  is  definitive,  it  is  local  in  nature,  allowing  one  to  test 
for  stabilizability  on  a  plant-by-plant  basis,  but  yielding  no 
global  criterion  with  which  to  characterize  a  set  of  plants 
which  is  simultaneously  stabilizable.  To  the  contrary,  one 
may  adopt  a  global  geometric  viewpoint  to  the  effect  that  a 
prescribed  set  of  plants  is  simultaneously  stabilizable  if  and 
only  if  it  is  contained  in  an  appropriate  half-plane.  The 
goal  of  the  present  paper  is  the  formulation  of  a  similar 
geometric  criterion  for  the  simultaneous  stabilization  prob¬ 
lem  applicable  to  general  linear  systems. 

The  starting  point  for  our  theory  is  the  ring- theoretic 
fractional  representation  theory  introduced  by  the  authors 
in  a  series  of  recent  papers  [9],  [16]  in  which  the  set  of 
compensators  for  a  given  plant  are  parameterized.  More¬ 
over.  as  a  first  cut  at  the  simultaneous  stabilization  prob¬ 
lem.  one  can  reverse  the  role  of  the  plant  and  compensator 
in  this  theory  to  parameterize  the  set  of  plants  which  are 
stabilized  by  a  given  compensator.  In  practice,  however, 
one  is  not  given  a  compensator  a  priori  and.  as  such,  we 
must  characterize  the  set  of  plants  obtained  by  the  latter 
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Fig.  I. 


parameterization  independently  of  the  choice  of  compensa¬ 
tor.  For  instance,  in  our  first-order  example,  the  set  of 
plants  in  Fig.  1(a)  are  stabilized  by  a  given  compensator 
-with  slope  1  /u  while  a  given  set  of  plants  is  simultaneously 
stabilizable  if  and  only  if  it  lies  in  the  half-plane  above 
some  line  through  the  origin.  For  the  general  problem,  this 
is  achieved  by  translating  the  fractional  representation 
theory  into  an  appropriate  geometric  setting  in  which  the 
“shape”  of  the  set  of  plants  obtained  from  the  latter 
parameterization  may  be  characterized.  In  turn,  the  simul¬ 
taneous  stabilization  problem  may  be  resolved  by  requiring 
that  the  given  set  of  plants  lie  in  a  region  of  the  ap¬ 
propriate  “shape.” 

Indeed,  the  appropriate  geometric  setting  proves  to  be 
just  the  Grassmannian  first  introduced  into  the  system 
theory  literature  by  Hermann  and  Martin  [11],  [12].  Unlike 
'  their  frequency  domain  formulation,  however,  we  obtain 
the  Grassmannian  directly  from  the  ring-theoretic  frac¬ 
tional  representation  previously  employed  by  the  authors. 
Indeed,  the  Grassmannian  is  obtained  simply  by  factoring 
out  the  nonuniqueness  inherent  in  the  fractional  represen¬ 
tation  theory.  As  such,  in  addition  to  formulating  the 
global  theory  necessary  for  our  study  of  the  simultaneous 
stabilization  problem,  the  geometric  approach  yields  new 
insight  into  the  relationship  between  the  fractional  repre¬ 
sentation  theory  (which  we  identify  with  the  elements  of  a 
general  linear  group)  and  the  system  itself  (which  we 
identify  with  the  elements  of  a  Grassmannian). 

In  the  following  section,  the  fractional  representation 
tneory  is  reviewed  and  the  required  Grassmannian  is  con¬ 
structed.  The  resultant  theory  is  then  used  to  formulate  a 
global  description  of  the  set  of  stabilizing  compensators  for 
a  given  plant  in  Section  III.  The  resultant  formulation  also 
yields  new  insight  into  the  problem  of  stabilizing  a  plant 
with  a  stable  compensator  [25]  for  which  a  necessary  and 
sufficient  condition  is  also  derived  in  Section  III.  Finally, 
the  simultaneous  stabilization  problem  is  investigated  in 
Section  IV  wherein  both  global  geometric  and  local  alge¬ 
braic  criteria  for  the  simultaneous  stabilization  of  a  pre¬ 
scribed  family  of  plants  are  obtained. 

Although  the  present  paper  is  formulated  in  terms  of  the 
abstract  (pseudo)  copnme  fractional  representation  theory 
of  (5j.  [9],  [10],  [  16]— [  1 9],  it  should  be  pointed  out  that  this 


is  simply  one  manifestation  of  a  family  of  related  ap¬ 
proaches  to  the  control  system  design  problem  developed 
during  the  past  half-dozen  years  by  Perr.ebo  and  Astrom 
[13],  [14];  Amsaklis.  Pearson,  and  Cheng  [i],  [6],  [7];  Youla. 
Bongiorno,  Jabr,  and  Lu  [22]— [25]:  and  Zames  [26]  among 
others  [2],  [20].  [2 1  ].  Indeed,  the  approaches  of  these  authors 
are  all  closely  related,  any  one  of  which  could  have  been 
used  as  the  basis  for  the  present  investtgauons.  In  particu¬ 
lar,  the  formulation  of  Zames  is  applicable  in  a  general 
ring-theoretic  setting,  and  is  essentially  equivalent  to  that 
employed  herein. 

II.  Fractional  Representation  and  the 
Grassmannian 

The  algebraic  fractional  representation  theory  is  set  in  a 
nest  of  rings,  groups,  and  multiplicative  structures: 

gDhDiDj. 

Here,  g  is  a  ring  with  identity  which  represents  the  general 
class  of  systems  with  which  we  wish  to  work:  rational 
matrices,  continuous  operators,  a  class  of  transcendental 
functions,  etc.;  and  h  is  a  subring  of  g  containing  the 
identity  which  models  the  systems  which  are  stable  in  some 
sense:  poles- in  a  prescribed  region,  transcendental  func¬ 
tions  with  restricted  singularities,  causal  operators,  etc. 
Finally,  i  denotes  the  multiplicative  set  composed  of  ele¬ 
ments  of  A  which  admit  an  inverse  in  g,  while  j  denotes  the 
multiplicative  subgroup  of  i  made  up  of  elements  which  are 
invertible  in  A.  Detailed  examples  of  this  structure  were 
given  in  14]  and  [8]  and  will  not  be  repeated  here. 

We  say  that  a  system  sGg  has  a  right  fractional  represen¬ 
tation  in  {g.  A,  i,  j)  if 

»-«/;  (2-0 

where  n„Eh  and  dtrE  i.  Furthermore,  we  say  that  this 
representation  is  right  copnme  if  there  exist  u,r  and  vtr  in  A 
such  that 

^  ^  '  (2.2) 

This  equality  is  equivalent  to  the  classical  coprimeness 
concept  for  rational  functions  and  matrices,  while  being 
defined  in  our  general  ring-theoretic  setting.  In  particular, 
if  g  is  the  nng  of  rational  functions  and  A  is  the  nng  of 
polynomials,  (2.2)  implies  that  nir  and  dsr  have  no  common 
zeros:  and  if  g  is  the  nng  of  rauonal  functions  and  A  is  the 
ring  of  exponentially  stable  rational  functions.  (2.2)  implies 
that  ntr  and  dlr  have  no  common  right  half-plane  zeros. 

Since  g  is.  in  general,  noncommuiative.  we  also  define  a 
left  fractional  representation  for  s  via  the  equality 

(2.3) 

for  ntl£h  and  d,,Ei.  Furthermore,  we  say  that  this  repre¬ 
sentation  is  left  coprime  if  there  exjst  usl  and  t*„  in  h  such 
that 

n,tu„  =  1-.4) 
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Of  course,  in  the  classical  case  of  a  rational  function  or 
matrix,  these  fractional  representations  are  assured  to  exist 
[10].  However,  this  is  not  the  case  in  the  general  ring-theo¬ 
retic  setting.  Therefore,  for  distributed,  time-varying,  and 
multidimensional  systems,  we  assume  that  our  plants  admit 
such  a  representation  as  a  prerequisite  to  the  theory.1 
Interestingly,  however,  if  such  a  representation  exists,  we 
may,  without  loss  of  generality,  choose  usl  and  vsl  such  that 
the  equality 

=  *»,A#  (2-5) 

also  holds  [8].  In  this  case,  we  say  that  the  representation 
for  s  defined  by  the  eighi-tuple  {nsr,dsr,usr,vsr,nsl, 
-  dsl,  us/,  uw)  is  doubly  coprime  and  we  express  the  defining 
equalities  of  (2.1)— (2.5)  via  the  matrix  equality  R~'  =  S, 
where 


where  o,  =  [d~' ]r,  is  an  appropriate  “partial  state"  vari¬ 
able.  As  such,  the  2X2  matrix  R,  defines  a  natural  model 
for  the  given  system.  Indeed,  when  such  a  model  is  em¬ 
ployed,  one  can  drop  the  invertibility  requirement  on  dtr 
(and  dsl),  although  the  matrix  Rs  must  still  admit  an 
inverse  with  entries  in  h } 

Since  the  2x2  matrices  R ,  and  S,  have  entries  in  h  and 
admit  an  inverse  which  also  has  entries  in  A,  they  form  a 
group  which  we  denote  by  GLh{ 2),  i.e.,  the  general  linear 
group  of  2  X  2  matrices  with  entries  in  A.  If  the  elements  of 
GLk(2)  are,  however,  to  serve  as  a  viable  system  represen¬ 
tation,  we  must  be  cognizant  of  the  fact  that  several  such 
matrices  may  represent  the  same  plant.  The  appropriate 
equivalence  classes  may,  however,  be  characterized  with 
the  aid  of  the  subgroup  E  C  GLk(2)  composed  of  the  upper 
triangular  matrices 


and 


It  is  interesting  to  compare  the  above  formulation  with 
that  proposed  by  Zanies  [26].  Rather  than  working  with  an 
unstable  system,  s,  Zames  assumes  that  the  given  plant  is 
first  stabilized  via  classical  techniques,  and  then  develops 
his  design  theory  around  the  resultant  stable  plant  s.  Now, 
since  s  is  stable,  it  admits  the  trivial  right  coprime  represen¬ 
tation  I  =  with  the  equality 

[0][s]  +  [l][r]  =  usrnsr  +  v,rd„  =  1  (2.8) 


implying  right  coprimeness,  while  a  similar  equality  defines 
a  left  coprime  representation  for  s.  As  such,  the  matrices 
Rr  and  ST  take  on  a  very  special  form,  permitting  Zames  to 
implement  his  design  theory  without  explicitly  dealing  with 
u's  and  u’s  nor  even  introducing  the  coprimeness  concept. 

The  key  to  our  geometric  formulation  of  the  fractional 
representation  theory  lies  with  the  observation  that  the 
2x2  matrices  R ,  and  S,  constitute  a  natural  and  concise 
representation  for  the  given  plant  which  can  serve  in  lieu  of 
the  eight-tuple  of  n’s.  d’s,  u’s.  and  u’s.  Indeed,  if  the 
input-output  relation  for  our  system  is  characterized  by 
the  equality 

(2.9) 

.  then  the  admissible  input-output  pairs  [9]  («,,«',)  for  our 
plant  are  parameterized  by  the  equality 


(2.10) 


in  GLk{ 2).  Note  that  since  these  triangular  matrices  are 
assumed  to  be  in  GL„( 2),  it  follow's  that  eu  and  are  in  j. 

Property  1:  Let  Rs  and  be  in  GLk(2).  Then  Rt  and  Rs 
represent  the  same  system  if  and  only  if  there  exists  EG  E 
such  that  R ,  =  RSE. 

Proof:  Tf  R,  —  R,E  for  some  EG  E.  then 


(2.12) 


where  o,  =  eltos.  As  such,  the  set  of  input-output  pairs 
defined  by  R ,  coincides  with  those  defined  by  Rs  except 
for  a  change  of  parameterization. 

If  R,  and  R,  define  the  same  set  of  input-output  pairs, 
then  for  any  such  pair  (tt,  rt),  there  exist  as  and  d,  such 
that 


which,  in  turn,  implies  that 


Now,  £  =  R~'RSG  GLk( 2)  since  Rs  and  R.  are  in  GLk( 2), 
while  R  =  RSE  by  construction.  As  such,  it  suffices  to  show 
that  EG  E.  This,  however,  follows  from  the  fact  that  (2.14) 
holds  for  all  a,  (and  a  corresponding  6S).  ■ 

Given  that  any  two  representations  in  GLk(2)  for  the 
same  system  differ  by  a  left  factor  in  E,  a  natural  setting 
for  our  system  theory  is  the  quotient  space  GLk(2)/E. 
Although  GLk( 2)  is  a  group  and  £  is  a  subgroup.  £  is  not 
normal  and.  as  such.  GLk(2)/E  is  not  a  group.  For¬ 
tunately,  however,  the  resultant  coset  space  (of  equivalence 
classes)  is  a  well-known  and  much  studied  geometric  ob- 


Computationalls.  (he  evaluation  of  u  and  i,r  (or  u,,  and  i„i  rtductb 
to  [he  '->!uwon  of  a  linear  equation  in  tne  nng  h  In  particular,  if  A  is  a 
ring  of  stable  rational  functions  or  matrices  thereof,  one  can  multiple  (2  2) 
and  (2  4)  h>  a  common  denominator,  and  thcrehs  reduce  their  solution  to 
that  of  a  classical  poK  norma!  equation 


•  In  the  case  where  d~,'  does  not  exist,  the  plant  defines  a  relation  rather 
than  a  (unction  on  the  input-output  space  The  resultant  relation  is. 
however,  parameterized  b>  o,  Moreover,  since  /?,  is  invertible,  the 
relation  is  normal  in  the  same  sense  of  115] 
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ject.  the  Grassmannian  [  1  ]  G„(  1 . 2).  which  we  will  adopt  as 
the  basic  setting  Tot  our  system  theory  .3 

Since  £  is  not  a  normal  subgroup  of  GLk{2).  G*(1.2)  is 
not  a  group  and.  as  such,  does  not  admit  an  ‘  internal” 
algebraic  structure.  The  group  GLk( 2)  does,  however,  act  as 
a  set  of  transformations  on  <7*(  1 . 2).  Indeed,  if  TG  GLk(l) 
and  [£,’]€  G*(l. 2}  is  the  equivalence  class  of  UGGLk( 2). 
we  define 
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(2.18) 


showing  that  tn=euE  j  [since  £  is  invertible  in  GLk{ 2)J. 
Conversely,  if  tuG  j,  we  may  factor  T  via 


r[c/]*[rt/]e<7*(i,2).  (2.15) 

Now,  if  [U\  —  [V],  then  Property  1  implies  that  there  exists 
EGE  such  that  V = i/£;  hence. 

T[V]  =  T[UE)  =  [H/E]  =  [TT/]  (2.16) 

via  Property  1.  As  such,  the  operation  of  GLk(2)  on 
G*(1.2)  is  well  defined. 

-  As  a  prerequisite  to  the  formulation  of  our  stabilization 
-4116017,  it  is  necessary  to  characterize  the  stable  systems  as 
interpreted  in  GLk(2)  and  G*(l,2).  Recall  that  if  s  5  stable 
(sGh).  then  s  admits  the  doubly  coprune  fractional  repre¬ 
sentation 

*■•[;  ?]• 

Denoting  the  set  of  such  2X2  matrices  by  W  CGLk( 2),  it 
then  follows  from  Property  1  that  the  set  of  all  representa¬ 
tions  for  the  stable  systems  in  GL„{2)  take  the  form 
S  =  WE  and,  as  such,  they  are  represented  by  [S] -{WE} 
=  [W]c  G*(l,2).  Although  W  and  £  are  both  subgroups 
of  GLk(  2),  they  are  not  commutative  and,  as  such,  S  —  WE 
is  not  a  group.  5  is,  however,  characterized  by  the  follow¬ 
ing  property. 

Property  2:  Let 

r=f'"  ',2 

,f2l  '22. 

be  in  GL„( 2).  Then  TG S  if  and  only  if  tuGj. 

Proof:  Since  S  —  WE  if  TES,  then  T  —  WE  for  some 
WG  W  and  EG  E;  hence. 

3U  *  is  the  field  of  scalars.  G»(1.2)  is  the  classical  Crassmaasaa  (of 
lines  in  2-space),  whereas  if  *  is  taken  to  be  n  x  n  matrices.  G,(l.2) 
reduces  to  the  classical  Grassmannian  of  n  planes  in  2  it  space  (1). 
Although  these  classical  Grass manmans  have  an  analytic  structure  (com¬ 
pact  manifold)  which  may  not  be  shared  bv  our  abstract  Grassmannian. 
the  algebraic  properties  of  G,(  1.2)  are  ail  tiiat  is  required  for  the  present 
theory,  and  hence  no  difficulty  is  encountered  by  working  with  elements 
taken  from  an  abstract  ring.  Indeed,  for  our  purposes,  we  only  use  the  fact 
that  G*(  1.2)  is  the  coset  space  GLk{ 2)/£  and  identify  it  as  the  Grassman- 
pian  only  to  make  the  connection  with  the  classical  literature. 

It  is  interesting  to  note  that  the  Grassmannian  has  been  used  as  a 
natural  setting  for  multivariate  systems  by  mathematical  system  theorists 
for  a  number  of  yean  128],  [29]."  [32]  Here,  a  system  represented  by  an 
n  x  n  frequency  response  mains  :s  identified  with  a  curve  taking  values  in 
the  classical  Grassmannian  of  n  planes  in  In  space.  As  such,  inai  theory 
identifies  a  system  with  a  Grassmannian-valued  function,  while  our 
formulauon  identifies  the  system  with  a  Grassmannian  built  from  a  ring 
of  functions  Of  course,  the  two  approaches  are  completely  equivalent  in 
the  muluvar.ate  case,  while  the  present  formulation  is  also  well  defined 
for  general  linear  systems  (time-varying,  distnbuted.  etc.) 


'it 

'-1- 

0 

'11 

'l2 

.'21 

'aj 

,'21'n 

l 

0 

'22  “,2i'i:,'i2. 

(2.19) 

Since  /, , £  j.  w  =  r2Ir~, 1  and  fa  ~  are  in  it.. Thus, 

the  uni:  triangular  nature  of  W  implies  that  it  is  in  GLk( 2). 
This,  in  turn,  however,  implies  that 

E  =  Hr'TGGLk(  2)  (2.20) 

since  GLk(2)  is  a  group.  Since  £  is  upper  triangular  and 
£  £  GLk(  2>,  EGE,  a  was  to  be  shown.  ■ 

Since  W  and  £  are  both  groups,  the  equality  5  =  WE 
implies  that  S~'  —  EW.  Now,  a  little  algebra  similar  to 
that  used  in  the  proof  of  Property  2  will  reveal  that 
TG  S~'  if  and  only  if  t^G  j.  Indeed.  5  and  S-1  are 
precisely  the -two  classes  of  matrices  for  which  the  2X2 
matrix  inversion  formula  is  applicable  [16]. 

Before  concluding  this  section,  it  is  instructive  to  com¬ 
ment  on  the  relationship  between  GLk( 2)  and  G»(1.2).  In 
essence,  GLk{2)  is  a  set  of  representations  for  our  systems, 
while  Gt(l,2)  represents  the  set  of  systems.  That  is  to  say, 
GLk( 2)  is  composed  of  the  computationally  tractable  ob¬ 
jects  with  which  we  actually  describe  a  system,  although 
many  such  objects  may  represent  the  same  system.  On  the 
contrary,  each  element  of  G*(l,2)  is  uniquely  identified 
with  a  system,  and  hence  we  may  think  of  Gk(\,2)  as 
“being  the  set  of  systems"  (or,  at  least,  being  in  one-to-one 
correspondence  with  the  set  of  systems).  On  the  other 
hand,  the  elements  of  G*(1.2)  are  not  computationally 
tractable,  except  through  the  intermediary  of  GLk{2).  In 
practice,  therefore,  a  plant  is  characterized  by  one  of  its 
representations  in  GL„{ 2).  while  the  goal  of  the  system 
design  problem  is  to  specify  an  appropriate  compensator  in 
Gk(  1 , 2).  That  is.  we  are  designing  a  compensator  rather 
than  a  representation  of  a  compensator,  and  hence  even 
though  we  work  in  GLk( 2)  as  a  matter  of  computational 
necessity,  the  result  of  the  design  process  is  an  element  of 

G*(U). 

III.  Stabilization 

The  basic  feedback  system  we  consider  is  shown  in  Fig. 
2.  The  system  is  characterized  by  the  connection  equations 

'  »c  (3  1) 

and 
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.1"' 


<0  =  /*!- V  ( 

while  the  plant  and  compensator  are  characterized  by 


=^o 


vcf]=/Jco]>  (3-4) 

respectively,  where  Rp  and  R  c  are  in  GLh(2).  Letting  Q  be 
the  2  X  2  matrix  in  GL„(2)  defined  by 

«=[-?  J]-  <35> 

the  connection  equations  (3.1)  and  (3.2)  can  be  expressed 


A  little  algebra  will  also  reveal  that  Q~ 1  =  Q1  =  -  Q;  hence, 
this  connection  matrix  is  readily  manipulated.  Finally,  the 
substitution  of  (3.3)  and  (3.4)  into  (3.6)  yields  the  equality 

=0  “2^0  =Rpp°oPc  -QR<Qpi\ 

=  [R,Px  +  Q'RcQP$  ]■  (3-7) 


r[s]  =  {[rs]:ses}cc*(1.2).  (3.9) 

Finally,  let  Rc(Rp)  denote  the  set  of  stabilizing  compensa¬ 
tors  in  GA(1,2)  for  a  given  plant  represented  bv  RpE 
GL„(  2). 

Theorem:  R r(Rp)  =  QRpQ’\S}. 

Proof:  To  prove  the  theorem,  we  will  show  that  the  set 
of  all  representations  for  the  stabilizing  compensators  in 
GLk(2)  takes  the  form  QRpQ'S  for  some  SE  S,  from  which 
the  theorem  follows  upon  mapping  these  presentations  into 
corresponding  systems  which  are  identified  as  elements  of 
Gfc(l,2).  If  Rc  =  QRpQ'S  for  some  SeS,  then 

+  Q‘RcQP2]  =  [RfPx  -t  QiQRpQ'S)QP,] 

=  Rp[Pt  +  Q‘SQP2).  (3.10) 


Now,  if 


(3.11) 


with  suEj,  since  SES,  a  little  algebra  will  reveal  that 


Here  P ,  =  diag(l,0]  and  P2  =  diag[0, 1  j. 

On  the  basis  of  the  above  formulation,  we  say  that  the 
system  is  stable  if  and  only  if 


Q'SQ=  (3.12) 

since  (Q'SQ)^  =  suE  j.  As  such, 

Rp[P^Q'SQPi]=Rp[  "*£  €GLt(2).(3.13) 

showing  that  the  feedback  system  with  compensator 
QRpQ'S  is  stable. 

Conversely,  if  R  c  is  a  stabilizing  compensator  for  the 
feedback  system, 

[RpPx+Q'RcQP2]EGLk(  2).  (3.14) 

Moreover,  since  Q  and  Rp  are  in  GLk( 2),  we  may,  without 
loss  of  generality,  assume  that  Rc  is  of  the  form 


[*,/>,  +  Q'RcQP2]eGL>(2). 


Since  [RpPx  +  Q'RcQPi]  has  entries  in  h,  (3.8)  implies  that 
its  inverse  exists  and  also  has  entries  in  h.  As  such,  a 
feedback  system  will  be  stable  if  and  only  if  the  relation¬ 
ship  between  its  input  vector  col(/i:,>i,)  and  its  partial 
state  vector  col(o^,oe)  is  stable.  With  the  aid  of  (3.3)  and 
(3.4),  this,  in  turn,  implies  that  the  relationship  between  the 
system  input  vector  and  all  of  its  internal  variables  is 
stable,  and  the  converse  is  also  true  [9]. 

With  these  preliminaries,  we  now  proceed  with  our  first 
theorem  in  which  a  geometric  characterization  of  the  set  of 
stabilizing  compensators  for  a  given  plant  is  obtained.  To 
this  end,  recall  that  5  is  the  subset  of  GLk(2)  correspond¬ 
ing  to  the  stable  systems  and  [5]  is  its  image  in  G*(1.2), 
while  any  TEGLt( 2)  defines  a  transformation  on  [S]C 
G*(1.2)  via 


Rc  =  QRpQ'X. 
in  which  case  it  suffices  to  show  that 
X  =  QR;'Q'RCES. 
To  this  end,  we  observe  that 


(3.15) 


(3.16) 


+  Q'R,QP2]  =  [RpPx  -r  Q'(QRpQ'X)QP:] 
=  Rp[p^Q‘XQP:] 


X'XQ  =  _ 


1  “  *:i  ] 


“  -*:i 

-X -■> 


(3.1") 


(3.18) 


» 
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that 


Now,  since  [RpPl  +  Q'RcQPz]EGLh(2), 

0  “**j  =R;'[RPP^Q'RcQP^GLt(l), 

(3.19) 

’  which  implies  that  x , ,  €  j  as  required  to  verify  that  XES. 
As  such,  an  arbitrary  stabilizing  compensator  for  out  sys¬ 
tem  is  of  the  form  Re  —  QRpQ'X. 

The  set  of  all  representations  of  all  stabilizing  compensa¬ 
tors  in  GL„( 2)  thus  takes  the  form  QRpQ‘S  C  GL„(2). 
Now,  upon  mapping  this  set  into  the  Grassmannian,  we 
obtain 

-  (3-20) 

completing  the  proof.  ■ 

~  Unlike  the  previous  results  given  directly  in  terms  of  the 
—fractional  representation  theory  (9),  (24)  which  art  local  in 
nature,  the  present  theorem  yields  a  global  description  of 
the  set  of  stabilizing  compensators  for  a  given  plant.  In¬ 
deed.  the  required  set  is  just  a  copy  of  the  stable  system  in 
the  Grassmannian  transformed  by  the  action  of  QRpQ'. 

It  is  interesting  to  compare  the  parameterization  of  the 
stabilizing  compensators  of  the  theorem  with  that  obtained 
directly  from  the  fractional  representation  theory.  Indeed, 
a  little  algebra  with  the  results  of  [4]  will  yield  the  equality 

R<  =  QRfQ'W\  WEfr  (3.21) 

for  the  family  of  stabilizing  compensators.  Recalling,  how¬ 
ever,  that  5  =  HfE,  this  parameterization  differs  from  that 
of  the  theorem  only  in  that  the  equivalence  transformation 
£  has  been  deleted.  As  such,  the  set  of  stabilizing  com¬ 
pensators  of  (3.21)  includes  exactly  one  representation  for 
each  stabilizing  compensator  rather  than  parameterizing  all 
representations  for  the  stabilizing  compensators  as  is  the 
case  with  the  present  theorem.  Of  course, 

[QRJ>Q,ffj=QRQ,[f*'}  =  QRQ'[Sl  (3.22) 

showing  that  the  same  set  of  compensators  in  the  Grass¬ 
mannian  is  defined  by  the  two  theories. 

It  follows  immediately  from  the  theorem  that  a  stabiliz¬ 
ing  compensator  always  exists  given  that  the  plant  is 
modeled  by  an  RpEGLk( 2).  In  practice,  however,  one 
often  requires  that  the  compensator  be  of  a  special  form: 
'.table,  memoryless,  diagonal,  etc.  As  such,  if  [C]  represents 
he  desired  class  of  compensators  in  (7k(1.2)Jt  is  necessary 
■  Jesign  a  compensator  in  [C]C\QR  pQ'\S].  >n  which  case 
■,  ooi  clear  that  such  a  compensator  even  exists.  In  the 
*-.ere  [C]  represents  the  stable  systems,  i.e..  we  desire 
-J  .  stable  stabilizing  compensator  [25],  a  simple 
-•  -  v  r  -  he  existence  of  the  required  compensator  can, 
obtained  as  follows. 

^  plant  represented  by  RpE  GLh( 2)  can  be 
-  *  able  compensator  if  and  only  if 

Rres~s. 


Proof:  If  RpES~  '5,  there  exist  S,  and  S:  in  £  such 

Rp  =  sr's{.  (3.23) 

Now  let 

Rc  =  QSflQ'  (3.24) 

which  is  stable  since  the  congruence  transformation  de¬ 
fined  by  Q  maps  S~ 1  to  5  and  conversely.  Moreover,  since 
Q'  =  Q~\ 

RC=QST  {Q'  =  GSf  'Q'QSyQ'QS; ' Q ' 

^iQST'S&HQSr'Q1)  *  QRfQ'S  (325) 

where  S  =  QS,~,Q‘ES.  As  such,  we  have  constructed  a 
stable  stabilizing  compensator  for  Rp  as  required. 

Conversely,  if  RCES  is  a  stable  stabilizing  compensator 
for  R?,  then  by  the  theorem,  Rc  =  QRpQ'S  for  some  SE  S. 
As  such, 

Rp  =  Q'R'S~  'Q  -  Q'R'QQ'S-  'Q 

=  (C'^e)(C'5-,C)  =  ^'!Sl  (3.26) 

since  the  congruence  transformation  defined  by  Q  maps 
RfSS  to  Sj-'ES-1  and  S_1e£~' to  S,S5.  ■ 

Of  course,  when  the  hypotheses  of  the  corollary  held,  the 
set  of  stable  stabilizing  compensators  for  Rp  —  is 
nonvoid  and  given  by  {^]n^R^g'[5).  No  explicit  para¬ 
meterization  for  this  set  is,  however,  known  nor  is  it 
obvious  that  one  even  exists.  This  intuition  will  be  explored 
further  in  the  following  section  on  the  simultaneous  stabili¬ 
zation  problem. 

IV.  Simultaneous  Stabilization 

The  key  to  our  solution  of  the  simultaneous  stabilization 
problem  lies  with  a  reversal  of  the  analysis  used  in  the 
derivation  of  the  stabilization  theorem.  That  is,  one  as¬ 
sumes  that  RCE  GLP(2)  is  given  and  parameterizes  the  set 
of  plants  which  are  stabilized  by  Re.  Denoting  that  set  of 
plants  by  Rp(Rc\  we  obtain  the  following  theorem.  Since 
the  proof  for  the  theorem  is  virtually  identical  to  that 
already  given  for  the  stabilization  theorem  of  the  previous 
section,  it  will  not  be  repeated  here. 

Theorem:  R/(RC)  =  QRCQ,(5). 

Since  every  TE  GLk(2)  is  of  the  form  T  =  QReQ'  (with 
RC  =  Q'TQ),  the  lemma  implies  that  a  set  of  plants  PC 
Gh( ; ,  2)  can  be  simultaneously  stabilized  if  and  only  if  they 
lie  in  a  copy  of  the  stable  systems  [5)  transformed  by  the 
action  of  some  TEGLJfl).  Indeed,  this  is  the  desired 
generalization  of  the  first-order  example  given  in  the  intro¬ 
duction  to  the  case  of  a  general  linear  system.  In  the 
general  case,  the  Grassmannian  piavs  the  role  of  the  A-B 
plane,  the  stables.  [5],  play  the  role  of  the  half-plane,  and 
the  general  linear  group,  (7£*(2),  serves  as  the  group  of 
“rotations."  These  observations  are  summarized  in  the 
following  corollary. 

Corollary  2:  A  set  of  plants  P  C  G»(1.2»  can  be  simulta- 
neousiv  stabilized  if  and  onlv  if  thev  lie  in  7"[S)  for  some 
TEGLk(l.2). 
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Although  the  corollary  represents  a  complete  geometric 
solution  to  the  simultaneous  stabilization  problem,  it  is  not 
amenable  to  convenient  implementation.  We  therefore  give 
an  alternative  algebraic  solution  to  the  problem  in  GLh(2). 

Corollary  3:  Let  RpEGLk( 2),  pEP  be  representations 
for  a  set  of  plants  P  C  <7*0,2).  Then  P  can  be  simulta¬ 
neously  stabilized  if  and  only  if  there  exists  a  family  of 
matrices  SpES,  pEP  such  that 


As  such,  Rc  stabilizes  the  zero  plant  if  and  only  if  dcrE  j 
or.  equivalently,  R(ES.  The  problem  of  stabilizing  p  with 
a  stable  compensator  is  thus  equivalent  to  simultaneously 
stabilizing  p  and  the  zero  plant.  Since  the  zero  plant  is 
represented  by  1 E  GLJ2),  it  follows  from  (4.6)  that  p  can 
be  stabilized  by  a  stable  compensator  if  and  only  if 

*,  =  (1)"*,  =  $“5  (4.9) 


s-'s,  =  p,qEP. 

Proof:  If  Rc  simultaneously  stabilizes  Rp,  pEP,  then 
it  follows  from  Corollary  2  that  there  exist  SpES,  pE  P 
such  that 


Rp  =  QRcQ'Sp;  pEP.  (4.1) 

Hence, 

RJ'R<  =  {QR'Q,Sfy'{QRcQ,S,)  =  Sp-'S,.  (4.2) 

Conversely  if  there  exists  SpES,  pE  P  such  that  Rf'Rp  = 
S~'Sg-,  p,qEP,  then  we  let  p’  be  an  arbitrary  plant  in  P 
and  define  Rc  via 

Re  =  Q'RpS;'Q.  (4.3) 


Rt  is  independent  of  the  choice  of  p‘.  Indeed,  if  p"  is  an 
’alternative  choice,  then  since  R~'Rp-  =  S~-xSp~, 

Rp.Sp~'  =  Rp,(R-'Rp.SpZ')  =  RrSp~J.  (4.4) 

Moreover,  for  any  pE  P,  Rp-'Rp  —  5^"  'Sp:  hence, 

Rp  =  RpSp~.  'Sp  =  QQ'RP  S;.  'QQ'Sp 

=  Q{Q'R„SpZ  'Q)Q'Sp  =  QRcQ‘Sp  (4.5) 

from  which  it  follows,  via  the  corollary,  that  the  set  of 
plants  P  C  Gk(  1 , 2)  is  simultaneously  stabilized  by  Rc.  ■ 
It  is  interesting  to  note  that  in  the  special  case  in  which 
P  contains  exactly  two  plants,  say  p  and  q,  then  they  are 
simultaneously  stabilizable  if  and  only  if 

Rf'R'ES-'S  (4.6) 


which  is  identical  to  the  criterion  of  Corollary  1  to  stabilize 
a  plant  with  a  stable  compensator.  Indeed,  this  should  not 
be  surprising  since  the  problem  of  stabilizing  a  plant  with  a 
stable  compensator  is  completely  equivalent  to  the  problem 
of  simultaneously  stabilizing  the  given  plant  and  the  zero 
plant  (represented  by  R.  =  le  GLk(l)).  This  follows  im¬ 
mediately  from  the  definition  of  stability  used  in  (4]  or, 
alternatively,  one  may  let  R.  =  1  and 


R  = 


—  u. 


(4.7) 


in  which  case 


Q'R<QP J 


(4.8) 
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which  replicates  the  result  of  Corollary  3. 

Of  course,  a  similar  argument  can  be  used  to  obtain  an 
algebraic  criterion  for  the  simultaneous  stabilization  6f  a 
set  of  plants  P  by  a  stable  compensator.  To  this  end,  we 
simply  augment  the  set  of  plants  P  by  the  zero  plant  and 
then 'apply  the  theorem  to  /,,  =  {0)U/>.  Alternatively,  a 
geometric  criterion  for  the  simultaneous  stabilization  of  P 
may  be  obtained  by  requiring  that  />cr[S]  for  some 
TE  S~\  Since  the  corresponding  Rc  is  given  by 

RC  =  Q'TQ,  (4.10) 

the  resultant  compensator  will  be  stable. 

Finally,  the  possibility  of  parameterizing  the  set  of  com¬ 
pensators  (or  stable  compensators)  which  simultaneously 
stabilize  P  should  be  considered.  In  effect,  this  amounts  to 
parameterizing  the  set  of  2X2  matrices  TEGL^l)  or 
TES~'  such  that  Fcr[^]  which,  in  turn,  requires  some 
kind  of  parameterization  for  P.  In  particular,  if  /*  =  7'[5'], 
then  R'  —  Q'TQ  is  the  unique  stabilizing  compensator, 
while  the  stabilizing  compensators  for  a  single  plant  p  may 
be  parameterized  by  [5]  as  per  the  stabilization  theorem  of 
Section  III. 


V.  Examples 

Since  the  action  of  GLk( 2)  on  (7*0,2)  is  a  geometric 
invariant,  one  can,  at  least  intuitively,  say  that  the  “shape" 
of  T[5]  is  identical  to  that  of  [5],  and  every  set  whose 
"shape"  is  the  same  as  [5]  may  be  obtained  from  (5)  by 
such  a  transformation.  As  such,  Corollary  4  implies  that  a 
prescribed  set  of  plants  P  C  (7*(1,2)  admits  a  simultaneous 
stabilization  if  and  only  if  P  is  contained  in  a  subset  of  the 
Grassmannian  whose  “shape”  is  the  same  as  (S).  For 
instance,  in  the  example  of  Fig.  1,  P  must  be  contained  in 
an  appropriate  half-plane. 

Although  well-defined  coordinate  systems  do  exist  in  the 
Grassmannian,  at  the  time  of  this  writing  we  have  yet  to 
formulate  a  computational  algorithm  for  implementing  the 
above-described  simultaneous  stabilization  problem  in  any 
degree  of  generality.  Interestingly,  however,  in  the  case 
where  P  is  composed  of  exactly  two  plants  (a  two-speed 
motor  or  the  dynamics  of  a  swing-wing  aircraft),  a  simple 
frequency  domain  criterion  for  the  simultaneous  stabili¬ 
zation  problem  was  given  by  Chua  [8],  (16)  and  the  authors 
in  the  singie-vanate  case,  and  has  since  been  extended  to 
the  multivariate  case  by  Vidvasagar  and  Vjswanadham 
[19). 

To  illustrate  the  above-described  general  geometric  crite¬ 
rion  in  a  computationally  trackable  setting,  consider  the 


case  of  a  set  of  third-order  smgie-vanate  plants 


_  which  are  to  be  stabilized  by  a  proportional  compensator 
c(j)  =  i.  To  represent  the  class  of  plants  geometrically,  we 
identify  the  six-dimensional  space  of  such  plants  with  a 
family  of  three-dimensional  Euclidian  spaces  (with  coordi¬ 
nates  Po'P\-Pz)  parameterized  by  the  numerator  coeffi¬ 
cients  q0,  qx,  and  qv  As  such,  we  have  a  three-dimensional 
family  of  three-dimensional  spaces  ,,r  Of  course,  if 

there  is  no  feedback  (t-  0),  such  a  system  will  be  stable  if 
^  and  only  if 

Po’P2>0  (5-2) 


-  and 

Pi>Po/Pi  -  (5*3) 

independently  of  the  numerator  parameters.  The  resultant 
stability  region  for  the  t  —  0  case  is  thus  as  illustrated  in 
Fig-  3(a). 

In  the  case  of  a  nonzero  compensator,  a  similar  argu¬ 
ment  with  the  Hurwitz  criterion  will  yield  the  set  of 
inequalities 

Po  +  “l  o>0  (5*4) 

p;  +  tq2>0  (5.5) 


and 

Pi>(Po  +  t<io)/(P2  +  “h)-  (5*6) 

As  such,  for  a  fixed  value  of  the  numerator  coefficients  <?0, 
<7,.  and  qz.  the  new  stability  region  is  identical  in  “shape” 
to  the  t  *  0  case  except  for  a  shift  of  the  origin  to  the  point 
ltq0,tq{jqz)  in  as  illustrated  in  Fig.  3(b).  Of 

course,  as  t  changes,  the  origin  of  the  stability  region  moves 
along  the  line  determined  by  the  point  (q0,<7,,o2).  Unlike 
the  t  =  0  case,  however,  where  the  stability  region  is  inde¬ 
pendent  of  the  numerator  parameters,  the  line  along  which 
the  origin  of  the  stability  region  moves  is  determined  by 
the  numerators  parameters.  Thus,  in  this  example,  our 
three-dimensional  family  of  three-dimensional  Euclidian 
spaces  Afi'.f'.f.i  plays  the  role  of  the  Grassmannian .  with 
the  region  defined  by  (5.2)  and  (5.3)  in  each  such  space 
characterizing  the  stable  systems.  Furthermore,  the  reai 
group  (corresponding  to  the  proportional  compensators) 
acts  on  this  space  by  translating  the  stability  region  in  the 
-  space  j)  along  a  line  determined  by  the  point 

•  Although  the  above  example  was  derived  from  basic 
principles,  we  believe  that  it  illustrates  the  essential  geo¬ 
metric  nature  of  the  simultaneous  stabilization  problem  as 
formulated  in  our  abstract  theory.  Indeed,  a  prescribed  set 
of  plants  car.  be  simultaneously  stabilized  if  and  oniv  if 
they  are  contained  in  a  translate  of  the  stables. 


(b) 

Fig.  3. 


VI.  Conclusions 

Our  purpose  in  the  preceding  has  been  threefold.  First, 
we  have  attempted  to  exhibit  the  essential  relationship 
between  the  fractional  representation  theory  and  the  alge- 
bro- geometric  approach  to  system  theory.  Second,  we  have 
presented  a  global  solution  tc  the  feedback  system  sta¬ 
bilization  problem.  Third,  a  solution  to  the  simultaneous 
stabilization  problem  has  been  presented.  It  should,  how¬ 
ever,  be  pointed  out  that  the  solution  presented  for  the 
simultaneous  stabilization  problem  is  mathematical  in  na¬ 
ture  and  not  intended  for  computational  implementation. 
At  the  present  time,  no  computationally  feasible  solution 
to  the  simultaneous  stabilization  problem  is  known,  except 
in  the  case  where  P  contains  exactly  two  plants  wherein  a 
simple  frequency  domain  test  is  possible  [8],  [16]  and  in  the 
simple  low  dimensional  illustrated  herein. 

References 

H]  ?  I  Antsaklis  and  B  Pearson.  ”  Stabilization  and  regulation  in 
linear  muimanable  systems."  iEEE  Trans  Autortai  ionir.  veil. 
AC-:?,  pp  92S-V? 0.  !9"S 

G  Benguson,  "  Output  regulation  and  international  modes  — A 


20 


SA£K5  AND  MURRAY:  simultaneous  STABILIZATION  problem 


freauencv  domain  approach  "  Automatica.  vol.  13.  pp  333-345. 
I  977. 

[3]  F.  Brickell  and  R.  S.  Qark.  Differentiable  Manifolds  London. 
England  Van  Nosirand  Remhold.  1970. 

[4]  R.W.  Brocken  and  C  I  Byrnes.  “Multivanable  Nyquist  criiena, 
root  locus,  and  pole  placemen::  A  geometric  viewpoint."  IEEE 
Trans.  Automat.  Contr.,  to  be  published. 

[5]  F.  M.  Callier  and  C.  A.  Desoer.  “Stabilization,  tracking  and  dis¬ 
turbance  rejection  in  linear  multivariable  distributed  systems."  in 
Proc  1 7th  IEEE  Conf.  Decision  Contr .  San  Diego,  CA.  Jan.  1979. 
pp.  513-514;  also.  Univ.  California,  Berkeley.  Tech.  Memo 
UCB/ERL  M78/83.  Dec.  1978. 

[6]  L.  Cheng  and  J.  B.  Pearson,  “Frequency  domain  synthesis  of 
multivariable  linear  regulators."  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-23,  pp.  3-15,  1978. 

[7]  - ,  “Synthesis  of  linear  multivariable  regulators,"  IEEE  Trans. 

Automat.  Contr.,  to  be  published. 

[8]  O.  Chua,  M.S.  thesis,  Texas  Tab  Univ.,  Lubbock.  1980. 

[9]  C.  A.  Desoer.  R.-W.  Liu.  J.  Murray,  and  R.  Saeks,  “Feedback 
system  design:  The  fractional  representation  approach  to  analysis 
and  synthesis,”  IEEE  Trans.  Automat.  Contr.,  voL  AC-25,  pp. 
399-412.  1980. 

[10)  B.  Francis  and  M.  Vidyasagar,  “Algebraic  and  topological  aspects 
of  the  servo  problem  for  lumped  linear  systems,”  unpublished  notes, 
Yale  Univ.,  New  Haven.  CT,  1980 

—  [11]  R.  Hermann  and  C.  Martin,  “Applications  of  algebraic  geometry  to 
linear  system  theory,”  IEEE  Trans.  Automat.  Contr.,  vol.  AC-22, 
pp.  19-25,  1977. 

[12]  - ,  “Applications  of  algebraic  geometry  to  system  theory:  The 

MacMillan  degree  and  Kronecker  indices' as  topological  and  holo- 
morphic  invariants,"  SIAM  J.  Contr.,  vol.  16,  pp.  743-755.  1978. 

[13]  L.  Peraebo,  Ph.D.  dissertation,  Lund  Inst.  Technol.,  Lund.  Sweden, 
1978. 

[14]  - ,  “An  algebraic  theory  for  design  of  controllers  for  linear 

multivariable  systems:  Pans  I  and  II,**  IEEE  Trans.  Automat. 
Contr..  to  be  published. 

[15]  R.  Saeks,  Generalized  Networks.  New  York:  Holt,  Rinehart  and 
Winston,  1972. 

[16]  R.  Saeks.  J.  J.  Murray,  O.  Chua.  and  C.  Karmokoiias,  “Feedback 
svstem  design:  The  single  variate  case,”  unpublished  notes,  Texas 
tech  Univ.,  Lubbock,  1 980. 

[17]  R_  Saeks  and  J.  Murray,  “Feedback  systems  design:  The  tracking 
and  disturbance  rejection  problems."  IEEE  Trans.  Automat.  Contr., 
voL  AC-26,  pp.  203-217,  1981. 

[18]  M.  Vidyasagar.  H.  Schneider,  and  B.  Francis.  “Algebraic  and 
topological  aspects  of  feedback  system  stabilization,"  Dep.  Elec. 
Eng..  Univ.  Waterloo.  Waterloo,  OnL,  Canada,  Tech.  Rep.  804)9, 
1980. 

[19]  M.  Vidyasagar  and  M.  Viswanadham.  “Algebraic  design  techniques 
(or  reliable  stabilization.”  Dep.  Bee.  Eng.,  Univ.  Waterloo, 
Waterloo.  Ont.,  Canada.  Tech.  Rep.  81-02,  1980. 

[20]  W.  A.  Wolovich,  “Multivariable  system  synthesis  with  step  dis¬ 
turbance  rejection."  IEEE  Trans.  Automat.  Contr.,  vol.  AC- 19,  pp. 
127-130,  1974. 


[21]  W.  A.  Wolovich  and  P.  Ferreira.  “Output  regulation  and  tracking  in 
linear  muluvanable  svstems."  IEEE  Trans.  Automat.  Contr..  vol. 
AC-24,  pp.  460-465,  1979. 

[22]  D.  C.  Youla.  “lnierpolary  multichannel  spectral  estimauon."  un¬ 
published  notes.  Polytechnic  Inst  New  York.  Brooklyn,  1979 

[23]  D  C  Youla.  J  J.  Bongiomo.  and  H.  A.  Jabr,  “Modem  Wiener-Hopf 
design  of  optimal  controllers.  Pan  I."  IEEE  Trans.  Automat.  Contr.. 
vol.  AC-21,  pp  3-15.  1976. 

[24]  —  "Modem  Wiener-Hopf  design  of  optimal  controllers.  Part  II," 
IEEE  Trans.  Automat.  Contr..  vol  AC-21,  pp.  319-338,  1976. 

[25]  D.  C.  Youla,  J.  J.  Bongiomo.  and  C.  N.  Lu.  “Single-loop  feedback 
stabilization  of  linear  multivanable  dynamic  plants,"  Automatica, 
vol.  10,  pp.  159-173,  1974. 

[26]  G.  Zames.  “Feedback  and  optimal  sensitivity:  Model  reference 
transformation,  weighted  seminorms,  and  approximate  inverses.” 
IEEE  Trans.  Automat.  Contr.,  to  be  published. 


Richard  Saeks  (S,59-M*65-SM*74-F^7)  was 
bom  in  Chicago,  IL,  in  1941.  He  received  the 
B.S.  degree  in  1964,  the  M.S.  degree  in  1965,  and 
the  Ph.D.  degree  in  1967  from  Northwestern 
University.  Evanston,  IL,  Colorado  State  Univer¬ 
sity,  Fort  Collins,  and  Cornell  University.  Ithaca, 
NY,  respectively,  all  in  electrical  engineering. 

He  is  presently  Paul  Whitfield  Horn  Professor 
of  Electrical  Engineering,  Mathematics,  and 
Computer  Science,  Texas  Tech  University,  Lub¬ 
bock.  where  be  is  involved  in  leaching  and  re¬ 
search  in  the  areas  of  fault  analysis,  large-scale  systems,  and  mathematical 
system  theory. 

Dr.  Saeks  is  a  member  of  AMS,  SIAM,  ASEE,  and  Sigma  Xi. 


John  Murray  (M*78)  was  bom  in  Galway.  Ire¬ 
land.  on  August  8,  1947.  He  received  the  B.Sc. 
and  M.Sc.  degrees  from  University  College,  Cork, 
Ireland,  in  1969  and  1970.  respectively,  and  the 
Ph.D.  degree  from  the  University  of  Notre  Dame. 
Notre  Dame.  IN,  in  1974,  all  in  mathematics. 

He  is  currently  with  the  Department  of  Electri¬ 
cal  Engineering.  Texas  Tech  University,  Lub¬ 
bock.  His  principal  research  interests  are  in  the 
areas  of  several  complex  variables,  multidimen¬ 
sional  system  theory,  and  time-varying  systems. 


9> 


•> 


31 


|e 

,N 


THEORY  OF  DESIGN  USING  NONLINEAR  TRANSFORMATIONS 

Rcnjeng  Su,*  L.  R.  Hunt,-1-  and  George  Meyer* 

NASA  Ames  Research  Center 
Moffett  Field,  California  94035 


ABSTRACT 

This  paper  is  presenting  an  overview  of  the 
theory  of  transformations  from  nonlinear  systems 
to  linear  systems.  Topics  covered  include  (1) 
necessary  and  sufficient  conditions  for  transfor¬ 
mations  to  exist.  (2)  a  method  of  constructing 
transformations,  (3)  robustness  in  design  (based 
on  transformation  theory)  and  Lyapunov  functions, 
(4)  estimation  theory,  and  (S)  the  relationship 
between  transformation  theory  and  "nonlinear 
teroes."  Application  of  these  Tesults  to  auto¬ 
matic  flight  control  is  presented  in  another  paper 
at  this  session. 


I .  INTRODUCTION 

Suppose  we  have  a  system  that  is  nonlinear. 
If  there  is  a  set  of  coordinates  (in  state  and 
control  space)  for  which  this  system  appears  as  a 
linear  system,  then  the  nonlinearity  is  only  the 
result  of  an  unfortunate  choice  of  coordinates. 
This  leads  us  to  the  problem  of  classifying  all 
nonlinear  systems  that  can  be  transformed  to  con¬ 
trollable  linear  systems. 

The  systems  of  interest  to  us  are 


x(t)  •  f(x(t))  ♦  £  u.(t)g.(x(t))  .  (1) 

i-i*  1  1 

where  f.Sj  .g2»  •  •  •  >2m  *re  vector  fields  on 

Kn  [i.e.,  each  assigns  an  n  vector  at  points  of 
Bln]  and  f(0)  •  0.  We  present  necessary  and  suf¬ 
ficient  conditions  foT  these  systems  tc  be  trans¬ 
formable  to  controllable  linear  systems  in 
Brunovsky1  canonical  fora.  These  conditions  de¬ 
pend  on  Lie  brackets  from  the  field  of  differential 
geometry. 

Given  a  transformable  system,  we  mention  a 
method  for  constructing  a  transformation  to  a 
linear  system.  This  method  depends  on  solving 
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systems  of  ordinary  differential  equations.  Sev¬ 
eral-techniques  for  obtaining  exact  and  approxi¬ 
mate  transfoxmations  are  noted. 

Once  a  nonlinear  system  is  mapped  to  a  linear 
system,  the  known  theory  for  design  of  linear  con¬ 
trol  systems  can  be  used  to  design  for  the  non¬ 
linear  system.  We  show  that  this  technique  is 
robust.  In  terms  of  stability,  Lyapunov  functions 
for  the  linear  system  are  composed  with  the  trans¬ 
formation  to  provide  Lyapunov  functions  for  the 
nonlinear  system. 

Brief  comments  are  given  showing  the  rela¬ 
tionships  between  transformation  theory  and  esti¬ 
mation  and  ^between  transformation  theory  and 
"nonlinear  "reroes . " 

For  other  work  on  transformations  we  refer  to 
the  research  of  Kroner,2  Brockett,5  Jakubciyk  and 
Respondek,1*  Hermann. 5  and  the  results  in  Refs. 

6-12. 

II.  TRANSFORMATIONS 

If  we  have  a  controllable  linear  system  with 
a  inputs 

y  •  Ay  *  Bu  (2) 

then  Brunovsky1  proved  that  it  could  be  put  in  the 
canonical  form 

9  •  ,  (3) 

which  is  based  on  the  Kronecker  indices 

»*2. •  •  •  with  *  ...  *  e*  ■  n  and 

<!  >  <j  >  ...  >_  <B.  We  let  oj  ■  <1,02  •  <i  ♦  <2, 

. .  "o„,  ■  <i  *  *  •••  ♦  <m  *  n*  For  a  complete 

description  of  Eq.  (3)  see  Ref.  12. 

We  find  necessary  and  sufficient  conditions 
to  map  our  nonlinear  system  (1)  to  the  canonical 
linear  system  (3).  We  consider  transformations 
T  ■  (T] ,T2 , ■ . . ,Tn ,Tn< j , . . . , i n-m)  from 

®n*m((xi,X2 . x„,ui  ,u2, . . .  ,uB)  space)  to 

mn*m  ( (yj  (j,j . yn.vi  ,v2  , . .  .v„)  space)  which  apply 

foT  all  (X],X2 . *n)  *ome  °Pen  neighborhood  of 

(0,0 . 0)  in  Rn  and  which  map  the  origin  in 

x-soace  to  the  origin  in  /-space.  Of  course  we 
want  such  transformations  to  be  diffeomorphisms 
(V",Sf*  inverse,  nonsingular,  and  one-one)  and 
take  system  (1)  to  the  system  (3).  The  last  m 
coordinate  functions  Tn»] ,7n*2 . • • • are  the 
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only  ones  that  depend  on  the  controls  u, ,u2. 


Given  *4  “  vector  fields  f  and  g  on  R°  we 
define  the  Lie  bracket  of  f  and  g- 

where  ^  and  ^  are  Jacobian  matrices.  Similarly 
we  take4*  " 

Ca4°f.*J  -  g 

(adlf.g3  .  [f,g] 

CadJf.*J  •  [f.[f.*33  • 


(adkf.g)  -  [f.(ad*-lf,g)]  . 


III.  CONSTRUCTION  OF  TRANSFORMA ITONS 

We  denote  by  <-..}  the  duality  between  one- 
forms  and  vector  fields:  that  is,  if  h  is  a  Vm 
function  and  f  a  V*  vector  field 

It  is  shown  in  Ref.  11  that  under  the  assump¬ 
tions  of  Theorem  1.  we  have  a  desired  trmasfctma- 
tion  fTj  ,Tj  #  •  • .  ,T|^,Tj»*  j .  #T||*^3  if  and  only  if  w< 
can  solve  the  following  partial  differential  equa¬ 
tions 

(dTlt  (adjf.fi))  «  0,1  •  0.1 . *1-2 

and  i  •  1,2,.. ..a 

(dT^j.CadJf.g^)  -  O.J  -  0.1 . *t-2 

and  i  »  l,2,...,m 


A  set  of  V"  vector  fields  {fi,f-,...,fT} 
on  *n  is  involutlve  if  there  exist  func¬ 
tions  rijkisl  so  that 

r 

[fi.fjjCx)  •  £  Tiik(x)  fk  (X).  1  <  i.s  *  r.i  i  J  . 

Me  define  the  sets 

C  ■  |f,,[f,Sj],...,(ad  1  f.gj) ,gj» (f,g2l «• 

. . . , (ad  *  f ,12) , • • • rle, [f >g«3  * 
...,£adK"”lf,g«j| 

•  |gi ,  [f.g ; (ad  i  *f,g|)  .gj.tf.gj] , 

•  e#v(td  ^  •  *  *  *  »£•*  • 

...,(ad<j**f,gm)|  for  i  •  1,2,  ...  a 

The  following  result  from  Ref.  11  gives  condi¬ 
tions  under  which  a  transformation  of  the  type  we 
consider  exists. 

Theorem  1  The  system  (1)  is  transformable  to 
the  system  (j) ,  where  the  state  variables 
xi,X2,...,xn  are  contained  in  some  open  neighbor¬ 
hood  V  of  the  origin  In  Rn  ,  if  and  only  if 

1.  The  set  C  spans  an  n-diaensional  space 
at  each  point  of  V 

.  2.  The  sets  C<  are  involutive  for 

)  •  1,2, ....a 

3.  The  span  of  each  Cj  is  equal  to  the 

span  of  C^C 

j 

An  obvious  problem  is  how  to  construct  a 
transformation  if  the  hypotheses  of  this  theorem 
ere  satisfied. 


<*TcB.1.!.Cadjf.gi))  -  O.j  -  0.1 . k„-2  (4) 


and  i  •  1,2,. ...a 


<*T0l.f>  *  Z»l(dTi,(i4,1'1f1ii))  - 

<«•-.£>  *  f:>*i<<«Tai*l.(adej‘lf.gi)>  .  Tn<J 
i*i 

<dTB,f>  r  £ .ri.(^"'1f.gi)>  -  V, 
i»l 

whore  the  plus  sign  Is  for  <<  odd  and  the  minus 

sign  is  for  even,  i  ■  1,2 . a.  Me  also  want 

the  matrix 

<dTl.(ad,‘1'lf,gl)>...(dT1,(ad'tl*lf,gB3> 
<dToj-i .  (•d<J”1f,gl)>  . . .  (dT0l.j .  (ad'^f.g,)) 


I 

•  (•d‘**1f  .g,))  . . .  (dT,^., ,  (ad"*" 1  f , g^jJ 

to  be  nonsingular.  The  other  coordinate  functions 
of  the  transformation  are  found  by  differentiation: 

T2  -  Tj  -  4^,Tj  .  t2 . Tei.2  -  f0].j,  etc. 

Note  that  the  last  m  equations  in  (4)  hold, 
once  we  find  Ts  ,T0^«i , . . .  ,T0l1u.  ,L  satisfying  the 
first  sets  of  equations.  These' last  a  equations 

allow  us  to  solve  for  uj.u- . a-,  In  terms  of 

Tn-i>”n-2 . ”n*fl  (these  are  the  same  as 


/ 
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vj ,vj, . . .,vB) .  This  fact  is  extremely  important 
in  practice. 


Tl  *  *Pi  "  5 


To  find  solutions  Tj ,79.*j . . . . ,T»  _  of 
(4)  we  aove  froa  partial  differential  equations  to 
ordinary  differential  equations,  as  in  Ref.  11. 

We  order  the  n  vector  fields  in  C  as  follows. 
Let 

X!  .  (ad*1'^ ,gi) 


(ad  1  f.g2)  if  (ad  3"  f,g2)«C 


*  \j(ad  1  f.tl)if(adl  f,g2)<C 

f  <1*1  *,-l 

(ad  f.gj)  if  (ad  1  f.g3)«C 

(ad*l”lf,gjJ  if  (ad^f.gj^Cand  (ad*1'^ ,g2)«C 

(ad^f.gj)  if  (ad<tl”1f,g2)d  Cand  (ad<1'  f,g2)<C 

(ad*i**f,gj} if  (ad'Crlf,g2_)dCand  (ad^'f.g^dC 


V  «m  • 

We  than  examine  a  sequence  of  ordinary  dif¬ 
ferential  equations.  Real  parameters  s1(S2, 

. ...sn  are  introduced  by  solving  in  order 

Xj  with  x(0)  »  0 

X,  with  x(Sj  ,0)  •  x(Sj) 
dx 

X3  with  x(Sj.s2.0)  •  x(Sj  ,s2) 


3J-*  Xnwithx(Sj,s2 . sn.!,0)  -  x(sj,s2 . sn_2) . 

n 

For  i  •  1,2 . a  we  let  p,  equal  the  sub¬ 

script  of  that  X.  which  is  equaPto  (ad  ''‘f.gj). 
Then  we  let 


T0;*’.  ■  S?; 


T°a-I*i  "  *Pa 

By  the  inverse  function  theorem  we  can  solve 

for  si,S2,...,S|i  as  functions  of  xj,x2 . x„ 

[because  aatrix  (S)  is  nonsingular] ,  and  hence  we 
know  Ti,Tfl j*x»...,Tom-i*l.  Examples  of  this  pro-, 
cess  afe  given  in  Refs.  10-12. 

Henry  Ford  (a  graduate  student  of  the  second 
author)  is  currently  using  the  MIT  MACSYMA  program 
to  construct  such  transformations  and  is  also  inves¬ 
tigating  numerical  aethods.  Complete  solutions  for 
block  triangular  systems  are  given  in  Refs.  6  and  7. 
An  approximation  technique  using  the  concept  of  the 
tangent  model  is  presented  in  Ref.  8.  This  con¬ 
struction  is  useful  in  the  design  of  an  automatic 
flight  controller  for  a  helicopter.  The  tangent 
model  gives  us  the  linearization  of  the  traits  forma¬ 
tion  about  a  point  (without  having  the  actual  trans¬ 
formation)  modulo  constants 

IV.  ROBUSTNESS 

Our  Eq.  (1)  is  a  mathematical  model  for  some 
physical  plant  that  we  wish  to  control.  Suppose 
we  are  interested  in  the  problem  of  stabilization. 

If  we  choose  controls  to  asymptotically  stabilize 
the  model  (1)  about  the  origin,  do  these  controls, 
when  applied  to  nearby  nonlinear  systems,  asymp¬ 
totically  stabilize  them  about  their  respective 
equilibrium  points?  In  particular  we  hope  that  the 
plant  is  one  of  these  nearby  systems,  and  thus  we 
speak  of  robustness. 

In  our  work13  we  consider  a  transformable  non¬ 
linear  system  (and  to  make  the  present  discussion 
easier  we  assume  the  transformation  is  applicable 
on  all  of  Rn)  and  use  linear  feedback,  that  is,  we 
choose  vj >v2, . . .  ,vm  as  linear  feedback  controls, 
to  asymptotically  stabilize  the  linear  system  (3). 
The  corresponding  controls  ui,U2,..-,Ug  to  use  on 
the  mathematical  node!  and  the  plant  are  found  by 
solving  the  last  a  equations  in  (4) .  Given  any 
compact  set  K  C  Rn  ,x2, . . . .x^Jspace)  there  is 
an  open  neighborhood  of  the  system  (1)  (with 
ui,U2,...,un  substituted)  in  the  set  of  vector 
fields  in  r<f‘(K)  such  that  if  k€.-Y  then  a  solu¬ 
tion  of  x  •  k(x)  starting  at  any  point  xfK  must 
converge  to  the  equilibrium  point  of  x  •  k(x)  in  K. 
Here  VY  is  chosen  so  that  all  kt^Y  have  a  unique 
equilibrium  in  K.  Thus,  all  nearby  vector  fields 
are  also  stabilized;  the  proof  of  this  result 
appears  in  Ref.  13. 


Several  important  facts  are  used  in  developing 
this  theory.  One  is  that  if  linear  feedback  con¬ 
trols  v •  , v 2 _ ,vs  are  taker,  to  asymptotically 

stabilise  the  linear  canonical  system  (3) ,  then 
Lyapunov  functions  V(y)  can  be  constructed  in 
y-space.  However,  the  composition  of  a  V  with  a 
transformation  T  yields  a  Lyapunov  function  for 
the  nonlinear  system  (1)  with  controls 
U'.  ,U2, . . .  ,Ug  as  computed  from  our  equations. 

These  Lyapunov  functions  are  very  valuable  in 
assuring  us  of  the  above  mentioned  robuatness. 
Another  useful,  but  not  entirely  surprising,  dis¬ 
covery  is  that  the  eigenvalues  of  (3)  with 
v:,vj,...,va  substituted  are  the  same  as  the 
eigenvalues  of  the  linearization  of  (1)  about 
(0,0, . . ■ ,0)  and  with  the  corresponding 
u5,u:, - u*. 

V.  TRANSFORMATIONS ,  ESTIMATION,  AND. 

NONLINEAR  ZEROES 

Suppose  we  take  the  nonlinear  system  (1}  "sat¬ 
isfying  the  hypotheses  of  Theorem  1.  Me  add  an 
IR? -valued  output  t  ■  h(x),  which  is  \ f*.  In 
transforming  system  (1)  to  Brunovsky  canonical  form 
(3) ,  we  can  also  transform  the  output  to  obtain 
z  ■  h (*(>"))  •  h(x(T))  for  the  linear  system.  Me 
read  the  output  h(x)  of  the  nonlinear  system  and 
transform  to  the  output  for  the  linear  system.  If 
the  states  in  y-spacje  (where  the  dynamics  are 
linear)  can  be  estimated,  then  by  the  inverse 
transformation  we  have  estimates  of  the  states  in 
x-space. 

Bach1"  has  used  the  concept  of  transformation 
for  construction  of  an  estimator  of  aircraft 
notions ■ 

Continuing  to  considar  systems  with  outputs, 
we  turn  to  a  discussion  of  "nonlinear  zeroes,"  or, 
more  precisely,  nonlinear  zero  distributions. 

Given  a  linear  system  with  linear  output  there  are 
various  definitions  of  zeroes,  available.  We  refer 
the  interested  reader  to  Refs.  IS— 17. 

In  a  recent  paper,  Isidori  et  al.18  discuss 
nonlinear  decoupling  via  feedback  for  nonlinear 
systems-  They  introduce  the  concept  of  (f.g)  In¬ 
variant  distribution,  which  is  a  nonlinear  general¬ 
ization  of  the  (A,B)  invariant  subspace  of  Wonham 
and  Morse19  and  of  Basile  and  Marro.20  Krener  and 
Isidori21  give  the  definition  of  nonlinear  zero 
distribution,  analogous  to  the  ideas  of  zeroes  for 
linear  systems.  Two  cases  are  considered:  (1)  the 
number  of  inputs  •  i  <  ?  •  the  number  of  outputs 
and  (3)  *  »  p  (here  the  concept  of  (h,  f)  invariant 
distributions  must  be  introduced) .  Throughout  our 
discussion  we  examine  only  the  first  case. 


A  distribution  4  or.  Sn  is  an  assignment 
4(x)  of  a  lineal  subspacr  of  Rn  at  each  point  of 
We  assume  that  4  is  of  positive  constant 
dimension  anc  often  identifv  a  with  the  set  of 
vector  fields  in  it.  *e  also  take  4  to  be  invol- 
utive  and  regular  ^from  Ref.  18  this  means  that  the 
quotient  set  Rn/4  is  a  manifold) .  Tie  dis¬ 
tribution  4  is  (f.g)  invariant  if  there  exist 
a(x)  and  B(x)  so  that 

fg.iJCd 

where  f  •  f  *  ga  and  J  ■  g  S.  We  say  that  A  is 
a  null  observable  distribution  if  it  is  (f.g)  in¬ 
variant  ana  contained  in  the  kernel  of 
dh  •  (dhi  ,dh2t  -  ■ .  ,dh-j) .  We  suppose  that  a  maximal 
such  distribution  exists.  From  Ref.  21  we  deduce 
that  there  arc  initial  conditions  (not  Including 
points  where  f.gl,...,ga  are  linearly  dependent) 
in  each  level  set  of  the  output  (the  kernel  of  dh) 
and  controls  that  force  the  output  to  stay  in  the 
level  set  in  which  it  begins.  This  constant  output 
is  the  nonlinear  generalization  of  the  output  being 
zero  for  a  linear  system. 

How  does  the  above  concept  relate  to  the  trans¬ 
formation  theory  of  this  paper?  Suppose  our  system 
(1)  [or  (6)]  is  transformable  as  provided  in 
Theorem  1.  Does  there  exist  an  m-dimensional  Vm 
output  for  which  the  system  is  observable  for  every 
input,  but  for  which  there  is  not  a  null  observable 
distribution?  The  answer  is  yes  if  we  take 
!»l  •  Tj.hz  •  Since 

Tj  ,T0.  .l  ....  ,T0ai_i^l  satisfy  th*  equations  in  (4), 
no  nontrivial  (f.g)  invariant  distribution  can  be 
found  in  the  kernel  of  dh,  and  the  output  is  ob¬ 
servable  for  every  input.  This  follows  from  the 
method  in  Ref.  18  of  building  the  maximal  (f,g) 
invariant  distribution  contained  in  the  kernel  of 
dh  by  differentiating  the  output  repeatedly  with 
respect  to  time.  Let  us  examine  this  for  the  case 
a  •  1  and  h(x)  ■  Ti(x).  To  hold  the  output  Tj 
constant  we  must  have 

Tj  ■  constant 

T2-Tl-.(dT1,f>.u<dTl,gl)  -0 
t3  -  i,  -  <dT2.f>  *  u<dT2.g;> 

•  (dTj.f /  -  u<dT.,[f.g,]>  •  0 


n 


Our  system  is  (1)  which  we  now  write  as 

x  «  f(x)  ♦  g(x)u  (6) 

with  output  t  «  h(x)  .  Here  g(x)  is  the  nxa 

matrix  (gj.g, . 2d,}  and  u  the  column  vector 

(ui ,uj, . . . ,u«)  and  the  meaning  of  g(x)u  is 
obvious. 


<dT  .  .f :  t  u(dT,  ,(ad  •  *f.g,)>  •  0 
n-  i  i 


T:i- 1  *  ‘n  *  <dVf  '  *  u<dTn'*:' 


( dT  ,£)  i  u(dT.  ,  (ad  :  'f.g.')  »  3 
n  * 


i* 


The  equations  . <dT2,gi>  »  -<dTj , [f,gi]> , . . . , <dT ,gi> 
*  t<dT;,fad  ’"‘f.Ji))  follow  from  a  Leibniz 
formula  used  in  developing  Eqs.  (4)  ir.  Ref.  11. 

Now  <dTi , (adif ,gi)>  »•  0  for  j  »  0,1,2. ... ,<i-2 
by  Eqs.  (4)  and  <dT, ,  (ad,c!-*f.gl)>  P  0  from  (S). 
Hence  u  ■  0  and  from  (7) 

(dT],f>  »  <dTj,gj>»  0 
<dT.,f)  .  (dT  ,gl>-  0 


<dT„...f>  -  (dT^j.gj).  0 

Since  dTi ,dT2, . . • ,dTn  .  are  linearly  independent, 
the  initial  conditions  for  which  we  can  apply  0 
control  and  stay  ir.  a  level  set  of  the  output 
coincides  with  the  set  of  points  where  f  and  g 
are  linearly  dependent.  Thus,  no  (f,g)  invariant 
distribution  contained  in  the  kernel  of  dh  exists. 
Also  notice  that  since  u  does  not  appear  in 

?1,T2 . Tn  and  dTj ,dT2i . . . ,dTn  are  linearly 

independent,  this  system  is  observable  for  every 
input. 

The  main  purpose  of  the  above  discussion  is 
to  show  the  relationship  between  the  equations  for 
dTi,dTo,„i....,dTaTO  in  (4)  and  the  method  of 

comouting  null  observable  distributions  in  Refs. 

18  and  21. 

VI.  CONCLUSION 

We  have  presented  a  sumary  of  the  theory  of 
controller  design  via  transformations  of  nonlinear 
systems  to  linear  systems.  The  related  topics  of 
the  construction  of  transformations,  robustness, 
estimation,  and  "nonlinear  zeroes"  have  also  been 
treated.  As  mentioned,  a  paper  discussing  the 
application  of  our  theory  is  contained  elsewhere 
in  these  conference  proceedings. 

REFERENCES 

1.  P.  Brunovsky,  A  Classification  of  Linear 
Controllable  System,  Kibemetika  (Praha),  vol.  6, 
197C,  pp.  173-187. 

2.  A.  J.  Krener,  On  the  Equivalence  of  Con¬ 
trol  Systems  and  the  Linearization  of  Nonlinear 
Systems,  SIAM  J.  Control,  vol.  11,  1973,  pp.  670-676. 

3.  R.  W.  Brockett,  Feedback  Invariants  for 
Nonlinear  Systems,  IFAC  Congress,  Helsinki,  1978. 

4.  B.  Jakubczyk  and  W.  Respondek,  On  Lineari¬ 
zation  of  Control  Systems,"  Bull.  Acad.  Pol.  Sci.. 
Ser.  Sci.  Math.  Astronoa.  Phvs.,  vol.  28.  1980, 

pp.  S17-S22. 

5.  R.  Hermann,  The  Theory  of  Equivalence  of 
Pfaffian  Systems  and  Input  Systems  under  Feedback, 
preprint. 

6.  G.  Meyer  and  l.  Cicolani.  A  Formal  Struc¬ 
ture  for  Advanced  Automatic  Flight  Control  Systems, 
NASA  TNO-7940,  197'. 


7.  G.  Meyer  and  L.  Cicoiani,  Applications  of 
Nonlinear  System  Inverses  to  Automatic  Flight  Con¬ 
trol  Design  -  System  Concepts  and  Flight  Evalua¬ 
tions,  AGARDograph  on  Theory  and  Application  of 
Optimal  Control  in  Aerospace  Systems,  P.  Kant,  ed., 
1980. 

8.  G.  Meyer,  The  Design  of  Exact  Nonlinear 
Model  Followers,  1981  Joint  Automatic  Control 
Conference,  FA-3A. 

9.  R.  Su,  On  the  Linear  Equivalents  of  Non¬ 
linear  Systems,  Systems  and  Control  Letters,  to 
appear. 

10.  L.  R.  Hunt,  R.  Su,  and  G.  Meyer,  Global' 
Transformations  of  Nonlinear  Systems,  IEEE  Trans. 
Autom.  Ccntr. ,  to  appear. 

11.  L.  R.  Hunt,  R.  Su,  and  G.  Meyer,  Design 
for  Multi-Input  Nonlinear  Systems,  Conference  on 
Differential  Geometric  Control  Theory,  to  appear. 

12.  L.  R.  Hunt  and  R.  Su,  Control  of  Non¬ 
linear  Time-Varying  Systems,  20th  IEEE  Conference 
on  Decision  and  Control,  San  Diego,  CA,  1981, 

pp.  5S8-563. 

13.  R.  Su,  G.  Meyer,  and  L.  R.  Hunt,  Robust¬ 
ness  in  Nonlinear  Control,  Conference  on  Differ¬ 
ential  Geometric  Control  Theory,  to  appear. 

14.  R.  E.  Bach,  Jr.,  A  Mathematical  Model  for 
Efficient  Estimation  of  Aircraft  Motions,  6th  IFAC 
Symposium  on  Identification  and  System  Parameter 
Estimation,  to  appear. 

15.  A.G.J.  MacFarlane  and  N.  Karcanias,  Poles 

and  Zeroes  of  Linear  Multivariable  System:  A 
Survey  of  the  Algebraic,  Geometric,  and  Complex 
Variable  Theory,  Int.  J.  Contr. ,  vol.  24,  1976, 
pp.  33-74.  “ 

16.  C.  A.  Desoer  and  J.  D.  Schulman,  Zeroes 
and  Poles  of  Matrix  Transfer  Functions  and  Their 
Dynamical  Interpretation,  IEEE  Trans.  Circuits 
Systems .  vol.  21,  1974,  pp.  3-8. 

17.  B.  F.  Wyman  and  M.  K.  Sain,  The  Zero 
Module  and  Essential  Inverse  Systems,  IEEE  Trans. 
Circuits  Systems,  vol.  28,  1981,  pp.  112-126. 

18.  A.  Isidori,  A.  J.  Krener,  C.  Gori-Giorgi, 
and  S.  Monaco,  Nonlinear  Decoupling  Via  Feedback: 

A  Differential  Geometric  Approach,  IEEE  Trans. 

Autom.  Contr..  vol. 26,  1981,  pp.  331-34S. 

19.  W.  M.  Wonham  and  A.  S.  Morse,  Decoupling 
and  Pole  Assignment  in  Linear  Multivariable  Systems: 
A  Geometric  Approach,  SIAM  J.  Control ,  vol.  8,  1970, 
pp.  1-18. 

20.  G.  Basile  and  G.  Marro,  Controlled  and 
Conditioned  Invariant  Subspaces  in  Linear  System 
Theory,  J.  Optimization  Theory  Apol..  vol  3,  1969, 
pp.  306-315. 

Cl.  A.  J.  Krener  and  A.  Isidori,  Nonlinear 
C  ro  Distributions,  1 9th  IEEE  Conference  on  Decision 
and  Control,  Albuquerque,  N.M.,  1980,  pp.  665-668. 


39 


o 


APPLICATION’S  TO  AERONAUTICS  OF  THE  THEORY  OF  TRANSFORMATIONS  OF  NONLINEAR  SYSTEMS 


George  Meyer,*  Renjeng  Su,**  and  L.  R.  Hunt* 

NASA  Ames  Research  Center,  Moffett  Field,  California,  U.S.A. 


ABSTRACT 

We  discuss  the  development  of  a  theory,  its  application  to  the  control  design  of  nonlinear  sys¬ 
tems,  and  results  concerning  the  use  of  this  design  technique  for  automatic  flight  control  of  air¬ 
craft.  The  theory  examines  the  transformation  of  nonlinear  systems  to  linear  systems.  We  show  how 
to  apply  this  in  practice,  in  particular,  the  tracking  of  linear  models  by  nonlinear  plants.  Results 
of  manned  simulation  are  also  presented. 


INTRODUCTION 

Suppose  we  model  a  physical  plant  by  a  nonlinear  system 


x(t) 


f(x(t>]  +  y*]  ui(c;g1  [x(t>  ] 
i-1 


(1) 


where  f.  gt  ,  •••.  gJn  are  C*  vector  fields  on  1Rn  and  f(0)  *  0.  If  we  are  to  have  the  output  of 
this  plant  follow  a  particular  path,  then  we  have  a  difficult  problem  to  consider.  However,  if  there 
are  new  state  space  coordinates  and  new  controls  under  which  equation  (1)  becomes  a  linear  system, 
then  our  task  appears  to  be  much  easier  because  of  the  known  results  for  controller  design  on  linear 
systems . 


We  feel  that  the  following  problems  are  thus  of  interest: 

(a)  Find  necessary  and  sufficient  conditions  for  the  system  (1)  to  be  transformable  to 
a  controllable  linear  systems. 

(b)  Show  how  to  use  these  transformations  so  that  the  controller  design  for  nonlinear  systems 
can  be  reduced  to  that  of  linear  systems. 

(c)  Apply  the  above  theory  to  the  field  of  aeronautics. 

In  the  next  three  sections  of  this  paper  we  discuss  the  solutions  of  these  problems. 


c 


t  ! 


TRANSFORMATION  THEORY 

The  classification  of  those  nonlinear  systems  that  can  be  transformed  to  linear  systems  is  actu¬ 
ally  a  subproblem  of  a  much  deeper  result,  the  construction  of  canonical  forms  for  nonlinear  systems. 
We  are  presently  developing  a  theory  for  such  canonical  forms,  and  in  the  case  that  a  nonlinear  sys¬ 
tem  is  transformable  as  in  this  paper,  the  canonical  form  is  actually  the  Brunovsky  (ref.  1)  form  for 
a  linear  system. 

Here  we  concentrate  on  the  transformation  theory  developed  ir  references  2,  3  and  4.  Other 
significant  research  in  this  area  is  due  to  Krener  (ref.  5),  Brockett  (rei.  6),  Jakubcytk  and 
Respondek  (ref.  7),  and  Hermann  (The  Theory  of  Equivalence  of  Pfaffian  Systems  and  Input  Systems 
under  Feedback).  We  also  refer  to  the  early  work  of  the  first  author  in  references  S  and  9. 

If  we  are  to  map  our  nonlinear  system  (1)  to  a  controllable  linear  system,  we  may  as 
well  assume  that  this  linear  system  is  in  Brunovksv  (ref.  1)  canonical  form  with  Kronecker  indices 

ra 

«,  ,  ....  <_  satisfying  Fk  ■  n  and  k,  >  <>...><  .  Hence  this  svstem  is 

i  i  m  i  i  -  i  —  ~  a 

y  »  Av  +  Bw  ,  (2) 

where  A  is  n  *  n,  B  is  n  *  m,  w  «  (w,  ,w  ,  . . . ,  w_)  are  the  new  controls,  A  is  equal  to 
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Th«  transformation  results  vs  present  are  actually  local  (in  some  oper.  neighborhood  of  the 
origin  in  (xj,  x2,  ....  x„)  space),  and  global  theorem*  are  found  In  reference  3.  We  simplify  note 
tion  by  saying  |»b  when  we  actually  mean  an  open  neighborhood  of  (0,  0,  ....  0)  in  JRn. 

K*  means  (uj,  u2.  ....  u*)  space  (or  (wj,  wj,  ....  w_)  space)  and  this  is  not  local. 


However, 
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We  discuss  the  allowable  transformations  napping  system  (1)  to  system  (2).  We  want  a  CT  nap 
Y  ■  (yi»  yj.  ....  yn>  V|,  v2,  •  ...  w*)  napping  Fn  *  F“C (Xj  »  Xj  »  ...»  Xyj  •  Uj  »  Uj  •  •••  *  u„)  space] 
co  Pn  *  F°  [(>-,,  y2 . y„,  Hj ,  w2,  ....  wm)  space]  that  satisfies  the  following  conditions: 

1.  y  naps  the  origin  to  the  origin, 

2.  yj.  y2,  . ...  yn  are  functions  of  xj,  x2 ,  ...»  Xj,  only  and  have  a  nonsingular  Jacobian 
matrix, 

3.  wlt  w2,  ....  wB  are  functions  of  xj ,  x2,  ....  x„,  uj,  u2,  ....  uB  and  for  fixed  Xj,  x2 

. ...  x,,,  the  b  x  a  Jacobian  matrix  of  wx,  v2,  wD  with  respect  to  uj ,  U2,  ..  • ,  Ug 

is  nonsingular, 

4.  y  saps  system  (1)  co  sysces  (2), 

5.  y  is  a  one-to-one  map  of  FB  x  F®  onto  RB  *  F®. 

Next  we  Introduce  soae  basic  definitions  from  differential  geometry. 

If  f  and  g  are  c”  vector  fields  on  Rn ,  the  Lie  bracket  of  f  and  g  is 

tf*«3  *  £  •  ta  * 

3g  df 

where  and  -jj  are  Jacobian  matrices.  Vie  let 

(ad°f,g)  •  g 
(ad'f.g)  -  [f,g] 

(ad2f ,g)  -  (f,(f,g)] 

(adkf,g>  -  tf(adk_1f,g)]  . 

A  collection  of  C*  vector  fields  hj,  h2 . hr  is  flivolutlve  if  there  exists  C"  func¬ 
tions  such  that 


r 

(h^h^Hx)  Y1Jk(x)hk(x)  ,  1  <  i,  J  <  r,  i  *  J 


Let  (...)  denote  the  duality  between  one  forms  and  vector  fields.  If  u  »  uj  dx ,  +  u2dx2  + 
...  -t-  wndxn  is  a  differentiable  one  fora  and  f  a  vector  laid  on  Fn,  then 

<w,f>  "  «ifj  +  «?f2  *  ...  +  “n^n 

To  state  the  main  result  from  reference  4  giving  necessary  and  sufficient  conditions  for  trans¬ 
forming  system  (1)  to  system  (2)  we  need  the  following  sets: 

C  •  |tl.[f,gi],...,(adICl‘1f,gi),gj,[f,g2],...,(ad,C2”1f,g2),..., 

. 

Cj  •  |^t^.[f,gl]....,(■d,C3’^f,^l).$^,!f,g2]..•.»(•<J,'^  *  f » 82^  t  •  •  • » 

If ,8B1 • ■ • > for  j»l,2,...,m 

Theorem  2.1  There  exists  a  transformation  Y  »  (yj,  y2,  ....  yn,  wj,  w2,  WB)  satisfying  condi¬ 

tions  1)  through  v)  above  if  and  only  if  on  Fn 

1)  the  set  C  spans  an  n  dimensional  space, 

2)  each  set  Cj  is  involutive  for  j  «  l,2,...,m,  and, 

3)  the  span  of  equals  the  span  of  Cj  ^  C  for  j  -  l,2,...,m. 

Let  oj  «  <j,  o2  “  +  <2,  ....  o  «  k j  +  k2  +  . , .  +  «  n.  Then  the  transformation  is  con¬ 

structed  in  reference  4  by  solving  the  partial  differential  equations 


<dvj , (ad^f  ,g^))  -  0, j*0, 1 . x j-2  and  i«l,2 . . 

<<Jy  +  .(ad^f.'g.))  -  O.J-O.l . X2-2  *nd  i“l,2,...,m. 


\r>.  \  s.* 


. V*  -V. 


rvrv: 


<  dy, 


y.  +  lt(adJf,g,)>  •  O,j-0,l,...,<  -2  and  1-1, 2,..., s. 
cn-l  *  * 


(3) 


<d3V°  *£  “i<dyffj'*i>  * 
i-l 

n 

<dy(J2*f>+2ui<dy«2,gi>  ’”2 


i-i 


<dVf>  +2  *i<dyn**i>  ’ 


where  the  aacrlx 


1-1 


(dyi.Cad'^f.gi)) 

<dyo1+l*(ad*l‘,f*«»>> 


(dyi.Ud'^f.g,)) 

<*9t*v<**rl*'*ii 


(O 


is  nonsingular. 

ea 

Ic  can  be  shewn  chat  aacrlx  (4)  being  invertible  naans  we  ten  solve  for  uJt  u2,  ...,  Ug 
la  terns  of  vi,  v2 ,  ...,  wa  la  the  lest  a  equations  in  equation  (3). 

equation  (3)  can  be  foraally  solved  by  consider lag  a  aaquence  of  ordinary  differencial  equations 
as  la  re far once  4,  but  we  shall  not  neat ion  details  here. 

Zf  a  nonlinear  systaa  Is  transforaable  to  e  linear  systea,  wa  study  the  process  of  using  the 
eraasforaatlsn  to  construct  a  controller  for  the  nonlinear  systea. 


TRAKSTORMATZORS  ZR  CONTROLLER  DESIGN 

Lac  T  -  (y,.  y2 .  ....  y„,  wj,  w2,  ....  wa)  be  the  transforaaclon  froa  systea  (1)  to  systea  (2) 
as  before.  The  structure  of  the  control  systaa  using  erans ferns don  theory  la  illustrated  in 
figure  1.  The  design  scheae  Is  lapleaeaced  on  the  "linear  part"  of  the  diagraa,  and  this  systaa 
is  la  Brunovsky  fora. 

Ho  ask  that  the  output  of  the  nonlinear  systea  follow  a  particular  path  which  corresponds  to  a 
trajectory  for  the  output  of  the  linear  nodal.  Zf  we  know  bow  to  design  for  the  linear  systea,  then 
we  actually  have  a  cracking  of  a  linear  aodel  by  a  nonlinear  plant. 

Linear  deelgn  is  used  to  generate  an  open  loop  coaaand  we,  for  the  systea  (2),  and  we  find 
the  corresponding  y  coordinates  ye  by  plugging  we  into  equation  (2).  The  transforaaclon  T 
aapa  the  aeasured  x  space  to  y  space  and  y  la  coaparcd  to  ye  and  the  difference  la  an 
error  Oy.  The  regulator  yields  a  control  4w  which  sends  a_  to  zero,  and  variations  In  plant 
dynaalcs  and  disturbances  are  ceapensated  for  in  this  way.  7 

The  controls  ve  and  4w  are  added  and  erans foraed  through  the  Inverse  map  R  (actually 
ve  -  4v  is  substituted  into  the  last  a  equations  la  equation  (3)  and  u  ■  (ult  u2,  ....  ug)  is 
generated)  to  obtain  a  control  which  la  applied  to  the  plant.  Thus  we  have  an  exact  aodel  follower, 
and  the  difficult  problea  of  finding  an  open  loop  control  and  the  regulator  control  are  constrained  to 
the  linear  systea. 

The  reaalnder  of  this  paper  contains  the  application  of  the  cransfornation  theory  to  aeronautics. 


AUTOMATIC  FLICHT  CONTROLLER  DESIGN 

The  aircraft  will  be  represented  by  a  rigid  body  aoving  In  3-dlaensional  spaee  in  response  to 
gravity,  eerodynaalcs  and  propulsion. 


9 4 


Th«  state 


G) 


e  XC  R3  *  R3  *  S0(3)  *  R3 


(5) 


where  r  And  v  arc  inertial  coordinAtas  of  body  center  of  baas  position  and  velocity,  ratpactlvely; 
C  la  tha  direction  cosine  matrix  of  body  fixed  axes  relative  to  the  runway  fixed  axes  (inertial), 
and  u  Is  the  angular  velocity. 


The  controls. 


■*© 


t  0  C  R*  *  F 


(6) 


where 


is  the  3-axis 


ent  control  such  as  ailerons,  elevator  and  rudder  in  a  conventional  air¬ 


craft  or  roll  cyclic,  pitch  cyclic  and  tall  rotor  collective  in  a  helicopter;  and 


controls 


power  -  throttle  in  a  conventional  aircraft,  and  tha  mein  rotor  collective  in  a  helicopter.  The 
state  equation  consists  of  the  translational  and  rotational  kinematic  and  dynamic  equations: 

f  -  v 

v  ■  fr(x,u) 
i  -  SMC 
&  -  f^x.u) 

where  fF  and  f*  are  the  total  force  and  moment  generation  proeeaaes  and  xcX. 
equation  (7)  into  a  linear  syatem. 

In  general,  fM  la  invertible  with  respect  to  the  (vector)  pair  (w.ir*),  and,  for  the  specific 


(7) 


Ue  wish  to  transform 


class  of 
to  tha 


then 


f  helicopter  maneuvers  being  considered  (i.e. ,  no  360*  rolls) ,  f *  la  invertible  with  respect 
(scalar)  pair  (Vj.u*).  Thus,  a  function  h:X  «  K*  ■*  0  can  be  constructed  such  that  If 


G) 


h(r,v,C,w,wg,  vjg) 


(•) 


u  •  Mg 
v*  -  vjg 


(9) 


for  all  admissible  maneuvers.  That  la,  angular  and  vertical  accelerations  can  be  chosen  as  the  new 
set  of  Independent  controls  in  which  case  the  state  equation  may  be  written  as  follows 


r  ■  v 

v  »  f°(r ,v,C,vjg)  +  cf 1 (r,v,C,vjg,u,wo) 
C  *  s(m)C 


(10) 


.  M  “  ug 

where  c  *  1,  and  fl(r,v,C, 0,0,0)  *  0 


for  all  admissible  maneuvers. 


The  function  f°  is  invertible  with  respect  to  ((vj  ,v2  ,E3(*))  ,C)  where  Ej(e)  is  an  elementary 
rotation  about  the  z-axls  and  represents  the  heading  of  the  helicopter.  Thus,  a  function  h*:F* 

«  S0(2)  •*  S0(3)  can  be  constructed  such  that  if 


Co  •  h* (r,v,$c,Ej«io)) 


then 


v0 


(11) 


(12) 


Equations  (8)  and  (11)  are  the  trim  equations  of  t.-.e  procsss  equation  (10)  (with  c  ■  0).  That  is, 
for  a  given  path  (r(t),  Ej(*(t))),  t  1  0  with  'vj(t)  -  0,  the  corresponding  state  and  control  may  be 
constructed  as  follows 
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vc  •  r(t) 

Co  •  of<ro.vu,v(c).E|,(v(t)) 

.  .  (13) 

wo  -  q(CC*) 

MQ  ■  <M(|) 

«c  • 

wten  the  fraction  q  extract*  a  froa  CCc  -  S(m)  .  The  required  eiaa  darlvativea  la  aquation  (13) 
con  bo  ceuMtod  pwlW  that  tho  pack  (r,*j)  la  generated  bp  ehc  systea  diagramed  la  figure  2 
when  a  rep  recent*,  a  •  color  latagrater  and  jj  eba  control  (w  in  the  previous  section  la  y*  here) . 

We  ceaotnct  an  ayproxlaotlen  to  the  linearizing  cranaforaatlen  aa  followa:  Tlt  »i,  Q  are  con¬ 
structed  ae  that 

. 

y  •  T(a)  *  Yo(xq)  +  Yjda  •  To  Yjdx 

.  .  (1*> 
u  •  *(«,ys)  ■  ug  ♦  Xjdp*  ♦  Qdx 

■ere  To  la  the  tranafornatlon  when  c  •  0,  aad  da  (and  4ys)  la  the  perturbation  about  the  nonlaal 
*o  (and  yj)  glean  la  equation  (13)  (and  figure  2). 

e 

froa  equation  (10)  with  c  -  0,  It  follows  that  (C  •  (X  ♦  t(c)Cg) 


('»)■  •%’,r  +  hr*,+TBT‘+ISr*1 


where  c  la  attitude  perturbation. 

The  pattern  of  equation  (13)  after 


t  of  eoordlnatea  la  shown  In  equation  (16). 


Zn  eh*  present  case  of  eh*  helicopter,  C],  C. ,  and  C5  are  negligible.  Their  effect  will  be  con¬ 
trolled  by  the  regulator. 


,v.q 


Th«  transformation* 


(17) 
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DESIGN  FOR  MULTI- INPUT  NONLINEAR  SYSTEMS 
L.  R.  Hunt*,  Renjeng  Su#*,  and  G.  Meyer* 

Abstract:  Consider  the  multi -Input  nonlinear  system 

m 

x(t)  •  f(x(t))  +  S  u-j (t)g^ (x(t) ) ., 

1«i 

m 

where  f,g.,  .  .  gm  are  V  vector  fields  on  some  neighborhood  of 
the  origin  In  Fn  and  f{0)-*  0.  We  present  necessary  and  sufficient 
-conditions  for  this  system  to  be  transformed  to  a  controllable  linear 
system.  Our  results  are  constructive  and  depend  upon  the  solutions  of 
overdetermined  systems  of  partial  differential  equations.  Moreover, 
we  Indicate  how  this  theory  Is  applied  to  build  an  automatic  flight 
controller  for  vertical  and  short  takeoff  (VSTOL)  aircraft.  Flight- 
test  simulation  results  are  presented. 

1.  Introduction.  We  examine  nonlinear  systems  of  the  type 

m 

x(t)  ■  f(x(t))  +  S  «l(t)gi(x(t))  ,  x  ■  ^|  ,  (1) 

m 

where  ftgl#  .  .  .,  gm  are  V  vector  fields  defined  on  some  open 
neighborhood  of  the  origin  In  Rn  and  f (0 )  ■  0.  Our  goal  Is  to  find 
necessary  and  sufficient  conditions  on  (1)  which  guarantee  that  this 
system  can  be  transformed  to  a  controllable  linear  system,  again  for 
(x^Xj,  .  .  .,  xn)  In  some  neighborhood  of  the  origin  in  Rn.  Of 
course,  we  want  our  transformations  to  map  the  origin  to  the  origin, 
to  have  a  nonsingular  Jacobian  matrix,  and  to  be  one-to-one.  For  the 
purpose  of  applications.  It  Is  convenient  to  have  a  method  to  construct 
a  transformation,  rather  than  just  conditions  under  which  a  transforma¬ 
tion  exists.  Therefore,  such  a  method  is  Introduced. 

Meyer  and  Clcolanl  In  [8]  and  [9]  have  considered  such  transforma¬ 
tions  for  block  triangular. systems.  Krener  [7]  found  conditions  if  the 
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transformations  are  restricted  to  state  space  coordinate  changes.  In 
the  single-input  case,  Brockett  [1]  presented  assumptions  under  which  a 
nonlinear  system  and  a  controllable  linear  system  are  equivalent  if 
coordinate  changes  and  additive  state  feedback  are  used.  With  the 
addition  of  input  space  coordinate  chances  for  a  multi-input  system, 
Jakubczyk  and  Respondek  [5]  discovered  necessary  and  sufficient 
conditions  for  a  nonlinear  system  to  be  equivalent  to  a  controllable 
linear  system. 

Recently,  the  second  author  [13]  proved  a  result  for  transforming 
a  single  input  nonlinear  system  (m  *  1  and  g1  ■  g)  to  a  linear  system, 
where  the  conditions  under  which  transformations  exist  are  more  general 
than  those  mentioned  above.  Essentially,  the  system  (1)  is  "equivalent 
(In  a  neighborhood  of  the  origin)"  to  a  linear  system  in  integrator 

form  if  and  only  If  g,[f ,g],(ad2f,g) . (edn“l  f,g)  are  linearly 

independent  and  g,[f,g],(ad2f,g),  .  .  .,  (adn~2  f,g)  are  involutive 
in  some  open  neighborhood  of  the  origin  in  ft1.  Here  [*,’]  denotes 
the  Lie  bracket,  (ad2  f,g)  *  [f,[f,g]],  etc. 

The  purpose  of  the  paper  Is  to  generalize  the  theory  In  [13]  to 
multi-input  systems.  First,  we  establish  a  notion  of  equivalence 
between  systems  and  determine  the  types  of  transformations  of  Interest 
to  us.  In  fact,  we  derive  a  system  of  partial  differential  equations, 
the  solution  of  which  gives  us  a  transformation.  We  remark  that  our 
sufficient  conditions  for  the  existence  of  a  transformation  are  weaker 
than  those  found  in  [5].  Host  importantly.  In  addition  to  showing  the 
existence  of  a  transformation,  we  actually  present  a  constructive  proof. 

Since  we  are  mapping  to  controllable  linear  systems  and  each  such 
linear  system  has  a  Brunovsky  [2]  canonical  form  based  on  its  Kronecker 
Indices  (see  [6]  or  [16]),  we  may  as  well  choose  a  set  of  Kronecker 
Indices  and  a  canonical  form  and  ask  for  necessary  and  sufficient  con¬ 
ditions  to  transform  our  system  (1)  to  this  canonical  form.  Suppose 
we  take  positive  Integers  (the  Kronecker  indices)  <1,k2,  .  .  .,  <m 
with  +  ie2  +  .  .  .  +  iem  ■  n  and  3  <2  -  •  •  •  -  <m  and  the 

Brunovsky  form  associated  with  these  indices.  Assume  that  g1,g2»  •  •  •. 
g^,  are  linearly  independent  near  the  origin  (this  can  be  weakened). 
With  a  possible  reordering  of  g,,g2,  •  •  ••  9m«  can  transform  our 
nonlinear  system  to  the  linear  system  in  this  Brunovsky  form  if  and 
only  if  in  some  neighborhood  of  the  origin  in  Rn: 


•  I 


i)  The  set  C  *  (g^Cf.g^,  .  .  {ad*1_1f ig,)  ,g2,[f ,g,],  .  . 

(ad*2  f,g2),  .  .  gm.[f.gm3 . (ad*m  lf»9m)>  spans 

an  n-dimensional  space, 

ic  . *2 

iij  The  sets  Cj  «  (gj.Cf .g13.  •  .  ..  (ad  J  f,g1),g2,[f,g2],  .  . 
(adKj  f,g2),  .  .  .,gm,Cf.gm],  ....  (ad*J  2f,gm)}  are 
involutive  for  j  «  1,2,  .  .  . ,  m,  and 

111)  The  span  of  each  Cj  is  equal  to  the  span  of  Cj  n  c. 

We  obtain  a  transformation  T:Rn4ffl  JR04™  where 

(xlfx2,  .  .  .,  xn,u1,u2,  .  .  . ,  um)  are  the  variables  in  the  domain  and 

(TjiTj*  ....  Tn4ffl)  are  the  variables  in  the  range.  Of  course 

Xj,x2,  .  .  . ,  xn  are  the  state  space  variables  and  ultu2 .  um 

are  the  controls  for  our  nonlinear  system  (1).  Likewise  Tj.Tj,  .... 

Tn  are  the  state  variables  and  T_ . ,  ,T  . . T  are  the  controls 

11  n+i  n+z,  n-nn 

for  our  linear  system  In  canonical  form.  The  existence  of  such  a 

mapping  follows  from  the  Frobenius  Theorem,  and  we  give  a  constructive 

proof  of  the  transformation. 

In  [5]  the  additional  assumptions  made  for  the  transformation 
problem  are  that  each  of  the  sets 

1  Kj-i  K.-i 

Cj  ■  (gj.CftgjJ.  •  •  •»  (ad  f.gx).g2»[f»923.  •  •  •  »  (ad  f.g2). 

x.-i 

•  •  •»  9ni*[f .9nj3 »  •  •  •»  (ad  f>9n|)} 

are  involutive  for  j  *  1,2,  .  .  .,  m  and  i  *  3,4,  .  .  .,  <j. 

However,  It  is  quite  surprising  that  we  can  show  that  these  extra  con¬ 
ditions  are  a  consequence  of  our  assumptions  i),  i 1 ) ,  and  iii)  above. 

This  fact  is  apparent  in  the  approach  we  take  through  the  topic  of 
partial  differential  equations  and  indicates  the  power  of  our  technique. 

At  NASA  Ames  Research  Center,  the  theory  provides  a  formal 
mathematical  framework  for  our  design  technique  involving  an  automatic 
flight  controller  for  VSTOL  aircraft.  The  highly  nonlinear  mathematical 
models  are  transformed  to  controllable  linear  systems,  where  known 
techniques  can  be  applied  to  build  a  controller.  The  partial  differ¬ 
ential  equations  approach  as  presented  in  this  paper  is  the  basis  for 
the  construction  of  our  transformations.  In  many  cases,  transforma¬ 
tions  can  be  built  and  researchers  are  now  working  on  methods  to  find 
transformations  by  the  symbolic  computation  of  the  MIT  MACSYMA  program 
and  by  numerical  methods.  We  also^have  a  technique  for  providing 


transformations,  which  are  computed  on-line  along  the  system  trajectory 
[10].  Furthermore,  we  are  able  to  build  a  transformation  by  first 
constructing  the  exact  mapping  on  a  lower  dimensional  submanifold  of 
Sn  and  then  extrapolating  off  this  submanifold  using  the  partial 
differential  equations. 

An  application  of  transformation  theory  to  the  design  of  exact 
nonlinear  model  followers  Is  given  in  Section  2.  In  Section  3  of  this 
paper  we  present  definitions  and  preliminaries.  Section  4  contains  our 
definitions  of  equivalence  for  systems  and  the  partial  differential 
equations  we  must  solve  to  find  a  transformation.  In  Section  5  we  give 
a  constructive  proof  of  our  result  Involving  transformations  and  present 
an  example.  Also  we  exhibit,  a  method  of  choosing  the  appropriate 
Kronecicer  indices  given  the  nonlinear  system  (1).  An  Appendix  contains 
the  results  of  flight  tests  for  automatic  control  of  the  UH-lH 
hel i copter. 

2.  Applications. 

9efore  we  begin  our  discussion  of  conditions  under  which  trans¬ 
formations  from  nonlinear  to  linear  systems  exist  and  how  such 
transformations  can  be  constructed,  we  first  present  an  application 
of  the  theory. 

Suppose  we  have  a  nonsingular  transformation  T  mapping  our 
system  (1)  to  a  time-invariant  and  controllable  linear  system  with 
state  variable  y  *  (ylfyz,  .  .  . ,  y n)  and  control  variable 
v  »  lv!»v2*  •  •  vm).  We  control  the  plant  (the  nonlinear  system) 
by  controlling  the  linear  system.  The  proposed  structure  of  the  com¬ 
plete  control  system  is  specified  in  figure  I.  Note  that  the  design 
is  carried  out  on  the  "linear  side"  of  the  transformation,  and  in  our 
theory  this  linear  system  Is  In  Brunovs  ky  form. 

Suppose  we  wish  the  output  of  the  linear  system  to  perform  a 

A  0 

particular  task  corresponding  to  a  similar  task  for  the  nonlinear 
system  in  x  space.  Linear  design  is  used  to  find  an  open-loop 
command  vQ  ,  and  we  obtain  the  corresponding  y*  coordinates  by 
substituting  v.  into  the  linear  system.  The  transformation  T  maps 
from  x  space  to  y  space  and  y  is  compared  to  y0  to  yield  an 
error  ey.  The  regulator  stabilizes  out  the  difference  using  a  control 
;v  ;  disturbances  and  variations  in  plant  dynamics  are  handled  in  this 
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way.  The  controls  vQ  and  5v  are  added  and  transformed  through  the 
inverse  map  T"1  to  give  a  control  u  ■  (ultu2,  .  .  . ,  1%)  which  is 
applied  to  the  plant.  By  this  we  generate  an  exact  model  follower. 

The  difficult  tasks  of  finding  the  open-1oop~control  and  the  regulated 
control  are  performed  on  the  linear  system  that  make  an  easier  design 
possible. 

The  aerodynamic  characteristics  and  operational  requirements  of 
modern  aircraft  present  the  control  system  designer  with  requirements 
that  are  increasingly  difficult  to  solve  using  the  standard  control 
system  design  methods.  There  can  be  strong,  multi axis,  highly  coupled 
nonlinearities,  and  anticipated  operational  capabilities  require  that 
the  aircraft  be  precisely  controlled  over  a  substantial  portion  of 
the  flight  envelope  that  encompasses  the  range  of  nonlinear  aerodynamics 
Thus,  the  nonlinearity  is  an  essential  part  of  the  design  problem. 

The  design  approach  discussed  in  this  paper  and  in  reference  [10] 
has  been  applied  to  several  aircraft  of  increasing  complexity.  The 
completely  automatic  flight-control  system  was  first  tested  in  flight 
on  a  DHC-6.  This  test  required  the  aircraft  to  operate  over  a  major 
part  of  the  flight  envelope,  and  the  system  performed  well  despite 
disturbances  and  plant  variations  (see  [14]).  Next,  the  approach  was 
applied  to  the  Augmentor  Wing  Jet  STOL  Research  aircraft  and  success¬ 
fully  flight  tested  [9].  Pilot  Inputs  were  incorporated  into  the 
design  method  in  [15].  The  approach  was  also  applied  to  control  an 
A-7  for  carrier  landing  and  tested  in  manned  simulation  (see  [11],  [12]). 

The  design  method  is  currently  being  used  for  the  UH-1H  helicopter. 
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3.  Prel iminaries . 

We  now  give  basic  definitions  and  results  from  differential 
geometry  and  from  the  theory  of  controllable  linear  systems. 

For  flT  vector  fields  f  and  g  on  Rn  we  define  the  Lie  bracket 
of  f  and  g 

[f,g]  »  la  f  .  11  g 
L  ,9J  3x  3x  9 

where  3g/3x  and  3f/3x  are  Jacobian  matrices.  Ue  can  introduce 
successive  Lie  brackets  [f,[f,g]],  [g,[f.g]].  etc.,  and  define 

(ad°f,g)  «  g 
(ad1f,g)  «  [f,g] 

(ad2f,gj  •  [f,[f,g]] 


(adkf,g)  «  C^.(*dk"2f.g>]. 

A  set  of  &  vector  fields  {fltf2,  .  .  . ,  fr>  on  Rn  is 
Invclutlve  If  there  exist  functions  ''f-f  jk(x)  such  that 

CfltfjJ(x)  *  Z  r1jit(x)fk(x)t  1  s  1,  J  <  r,  i  j»  j  . 

If  fj.fj*  .  .  ..  fp  Is  an  Involutive  collection  of  linearly 
Independent  vector  fields  and  xQ  €  Rn,  then  by  the  Frobenius  Theorem 
there  exists  a  unique  r  dimensional  V  submanifold  S  of  Rn 
through  xQ  with  the  tangent  space  to  S  at  each  x  €  S  being  the 
space  spanned  by  f1{x),f2(x),  ....  fP(x).  We  say  that  S  is  the 
unique  integral  manifold  of  fltf2,  .  .  . ,  f r  through  xQ. 

m  m 

Suppose  f  Is  a  'f  vector  field  on  Rn  and  h  is  a 
function  with  gradient  dh.  The  Lie  derivative  of  h  with  respect 
to _ f  is 

Lf(h)  «  <dh,f >  . 

where  <•,•>  denotes  the  duality  between  one  forms  and  vector  fields. 

If  j  Is  a  r(f  one  form  on  Rn,  we  define  the  Lie  derivative  of  m 


with  respect  to  f 


TO 


^  r  •"  rr1 


r: ,7*V»VI'\ r:F;vt  i 


where  *  denotes  transpose  and  3wV3x  and  3f/3x  are  Jacobian 
matrices. 

The  three  kinds  of  Lie  differentiations  just  defined  are  related 
by  the  Leibnitz  type  formula 

Lf<«*>.g>  *  <Lf(«),g>  +  <«,[f,g]>,  (2) 

with  f  and  u  as  before  and  g  as  a  V*  vector  field.  For  a  Vm 
function  h  we  also  have  dL^(h)  »  Lf(dh). 

We  Introduce  the  Kroneckfer  indices  and  the  Brunovs ky  [2]  canonical 
forms.  Consider  the  linear  system 

y  *  Ay  +  Bv  (3) 

where  y  and  v  are  n  vectors  and  m  vectors,  respectively,  and  A 
and  B  are  matrices  of  the  appropriate  size.  For  our  discussion  we 
assume  that  A  and  B  are  time  Invariant,  3  has  rank  m,  and  the  span 
of  {3,AB,  .  .  .,  An-13>  Is  n  dimensional.  Set  r0  »  rank  B, 

r^  »  rank  {8,AB,  .  .  .,  A^B>  -  rank  {B,AB,  .  .  .,  A^-1B>, 

1  <  j  <  n  -  1 . 

*W  w  m 

Obviously,  0  <  r.  <  m  for  0  <  j  <  n  -  1  and 

"  J  "  " 


We  define  the  Kronecker  Indices  <i,<2,  .  .  .,  by  <i  Is  the  number 
of  rj's  that  are  >  1  and  we  remark  that  >  K2  >  •  •  •  >  and 

m 

Z  “j  • n  • 

J.0 

Then  the  system  (3)* is  equivalent  to  a  linear  system  (In  Brunovsky 
canonical  form) 

y  *  Ay  +  Bv  (4) 


n-i 

E 

j«0 
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i. 


A 


’  ^.v.r.'.'/.-.v^: 


and  B  equals 


0  0  . 


1  0  . 


.  .  1 


Later,  In  the  coordinate  system  we  use  (ylfy2,  ....  yn) 

*  .  .  •.  Tn)  and  (vlfV2,  .  .  •  •  vm)  ■  (Tn+i,Tn+2’  *  *  *•  Tn-Hn^* 

4.  Equivalence  of  Systems. 

The  definitions  of  ^-transformations  and  ^-related  systems  for 
single  input  systems  are  given  in  [13].  We  extend  these  definitions 
to  multi-input  systems..  Let  U  be  an  open  neighborhood  of  the  origin 
in  Rn+ffl  space. 

Definition  4.1  *  A  <y- transformation  T  with  domain  U  is  a 
dlffeomorphism  onto  an  open  neighborhood  of  the  origin  in  Rn+ffl  which 
Is  nonsingular  and  maps  the  origin  to  the  origin. 

In  our  theory  U  is  essentially  V  *  Rm,  where  V  is  an  open 
neighborhood  of  the  origin  in  Rn,  the  state  space. 
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.•■V  V'J  •  -  '  .'-V* 


(xt»x2»  •  •  •  i  i *u2 »  •  •  ••  )  and  (y i *y£  *  •  •  •  * 

-V'‘'v* . *.>  (*<T1-TJ . T,.Vi'T« . Tn«) 

the  state  and  control  variables  In  u  and  T(ll),  respectively.  Suppose 

we  examine  two  systems 

Sx:x  -  a(xltx2 . *n.ui.u2 . uj 

and 

S2:y  ■  b(ylty2»  •  •  ••  yn.vi.V2,  .  .  . ,  vffl)  with  respective  state 
trajectory  functions  f  and  *. 

Definition  4.2:  The  system  Si  Is  ^-related  to  the  system  S2 
If  there  exists  a  ^-transformation  T  on  U  such  that  for  each  state 
x0  C  V  and  each  admissible  control  u  the  following  holds.  If  we  let 
y0  *  T(xo,u(0))  and  T(*(t;x0,u),u(t))  ■  (y(t),v(t))  whenever  ♦(t;x0,u) 
Is  a  state  In  U,  then  y(t)  »  *(t;y0,v). 

If  Si  Is  ^-related  to  S2.  by  the  transformation  T  with 
domain  U  we  say  that  Si  Is  T-related  to  S2.  In  fact,  by  the  fol- 
lowing  statement  It  makes  sense  to  say  that  Si  Is  T-eoul valent  to  S2. 
In  [13]  It  Is  shown  that  the  iT-relatlon  for  single  Input  systems  Is 
actually  an  equivalence  relation,  and  the  proof  can  be  generalized  to 
our  present  case. 

We  are  particularly  Interested  In  mapping  (via  a  ^-transformation) 
the  nonlinear  system 

i(t)  •  f(x(t))  ♦  f)  «4(t)«,U(t))  ,  (1) 

l-i  1  1 

with  f (0)  ■  0,  to  the  controllable  linear  system  (4)  In  Srunovsky 
canonical  form  with  Indices  «i,*2,  .  .  .,  «m.  We  set  Oi  -  *i,a2 

’  +  ** . *"i  ».  •  *  «!  *•••♦*■•  "• 

Theorem  4.3:  If  the  system  (1)  is  T-related  to  the  system  (4)  on 
U,  then 

a)  STj/ 3u^  -  0,  j-  ■  1,2,  .  .  .,  n  and  k  »  1,2,  .  .  .,  m, 

b)  the  ra  *  m  matrix  {3T./au.  },  j  ■  n  +  1 ,  n  ♦  2 . n+m 

j  x 

and  k  •  1,2,  ....  m  Is  nonsingular  on  U, 

c)  the  following  partial  differential  equations  hold  on  U. 
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*9j)  *  0 ^  »2 *  •  •  • »  1 ^  ^ »  •  •  •» 

=2  '  1>9»  *  1 . V:  '  ''Vi  +  '• 

.  .  . ,  n  -  1  and  i  *  1 ,2,  .  .  . ,  m, 

<dTitf >  »  Tt+1 ,  l  •  1 ,2 . ai  -  1  .ax  +  1 . 

^  +  1 . Vi  •  ’’Vl  + 

•  •  •  t  n  •  1 » 


m 

(dT0l,f  +  E  ui9i)  *  Tn+i  * 


<dTa2,f  ♦  E  ui9l)  -  Tn+_2  . 


<dT°m*f  ♦  E  ui9l>  •  Tn4ffl  .  J 

Proof.  With  XQ.u.yo.v,^  and  *  as  in  Definition  3.2,  let 

yj(t)  •  Tj(^(t;x0,u),u(t))  fbr  j  •  1,2 . n  and 

Vj  *  Tn+j(<fr(t;x0,u),u(t))  for  j  ■  1,2,  .  .  .,  m.  By  hypothesis  y^  Is 
a  state  vector  of  system  (4)  with  respect  .to  v(t)  and 

3y.  n  aT.  ax.  m  aT.  3u. 

. . . 

A  A 

Since  the  rows  of  Ay  +  Bv  In  (4)  are  Independent  of  au^/at, 
k  ■  1,2,  .  .  .,  m,  then  aT^/au^  *  0,  J  »  1,2,  .  ...n  and 
k  ■  1,2,  .  .  .,  m.  Because  the  transformation  T  is  a  dlffeomorphism 
by  definition  the  matrix  {aTj/au^},  j  •  n  +  l,n  +  2,  .  .  . ,  n  +  m  and 
k  «  1,2,  ....  m  is  nonsingular  on  U. 

From  the  canonical  form  (4)  we  have 

Tj,  *  T^^  ,  £  *  1*2,  •  •  *,®1  “  ^  ,0 1  •  •  •  f  ®2  “  ^  *®2  ^  ^  »  •  •  •* 

°m-i  ’  1  ,cm-l  +  lf  *  *  *•  n  "  1  ’ 
which  Is  equivalent  to 

<<Vf  ♦  Z  »,«,)  ■  V, 


for  the  same  set  of  z.  Since  T«,T?,  .  .  T  .  are  independent  of 

*  “  n 

U’,U2*  •  •  •  ♦  un  *•  Tind 


<dT,,gi)  «  0,1  •  1,2, 


-  1  .-i  +  1 ,  •  •  • .  f:  *  1 


+  1,  .  .  .,  c  -l,o  +1,  .  .  . ,  n  -  1 ,  and 

In- 1  0*1 

1  *  1 1 2  §  •  •  •  i  0| 


<dTt,f>  ■  T  ,t  •  1,2,  .  .  . ,  ot  -  1,®i  ♦  1 c2  -  1,  o2 

+  *  •  •»  ffB„i  *  t  ,0ia-i  +  *  *  •»  n  " 

The  canonical  fora  (4)  also  yields 

*«i  *  <dT«i,f  *  2  Vl>  *  Tn+x  • 

T«!  •  <*\.r  *  £  “,9,>  *  T„n  . 

*  ("a-1'  *  t  Vl>  ’  Tm,  • 

and  the  partial  differential  equations  hold  for  T. O 
Note  that 

(dT^.f )  *  T^j  •  L^(T^),i  *  1,2,  •  .  •,  ®i  •  1  »®i  +  1,  •  •  m  ®2  ■  1 
o2  +  1 ,  .  .  • *  "  1 »  cra-i  +  1 »  •  •  • .  "  -  1 • 

From  this  and  the  Leibnitz  formula  (2)  we  deduce  <dT2,g^>  *  0  Implies 
<dT1,Cf.g13>  *0  for  1  •  1 ,2,  and  <dT3,g^>  •  0  Implies 

<dTj ,(ad2f,g.j )>  »  0  fbr  1  *1,2,  .  .  .,  m,  etc.  Hence,  the  partial 
differential  equations  (5)  become 

<dTl,(adJf,g1)>  •  0,  J  ■  0,1 2  and 

1  *  1 ,2,  .  .  . ,  m  , 

1  >(«) 
<d7Cl+1 »(«dJf,g1 ))  «  0,  j  -  0,1 ,  .  .  . ,  <2  -  2  and  I 

1  ■  1 ,2,  .  .  .,  m  ,  I 


%  •  *  *  ■  •  /v  *  *  *  *  v  *  *>% 


<dTam_1+i,(ad'Jf,gi)>  *  0,  j  -  0,1, 


.  . ,  <  -  2  and 

m 


f  *1*2*  .  •  • »  m  > 


<dVf  *  r  Uj9l>  ■  Tn„ 

1*1 


HI 

r<Vf  *  £  Vl>  *  Tn+a 


(6) 

Concld. 


<4Vf  *  £,  u*9i> 

Since 

To,  •  Lf(Torl) 

\  ■  Lf(T„  ,) 


Tn  *  MVl1 

the  last  m  equations  In  (6)  by  the  Llehnltt  formula  become 

<dT0  .f>  ±  £  u.<dT1,(ad,Cl”1f,g1)>  »  T  . 
i  i«i  1 

(dT0  ,f>  ±  £  u1<dT0*lt(.rfc*-1f.g1)>  »T  . 

2  i«i  1 


«IT„.f>t  L  e,C»T<^.l»l.(ad,t«'1f.91»  •  Tn41D. 

where  +  is  for  <•{  odd  and  -  Is  for  k ^  even,  i  *  1,2,  .  .  .,  m. 

In  order  to  solve  for  u1>u2*  *  *  *'  um  t*iese  e9ua^ons 
the  matrix 

<dTt ,(ad<1’lf,gi)>  •  •  •  <dTl,(adICl  lf,gm)> 

<dT0  +i,(ad'e2  lf»9i))  *  *  *  (dT<j.+i,(ad  2 
1  .  • 

•  • 

•  • 

<dTcm_  +l , (ad*™’ 1  f , g i ) )  •  •  *  <dTam-l+1  *(ad  m 

must  be  nonsingular.  ,,, 


Hence,  the  partial  differential  equations  that  we  solve  to  con 
struct  a  transformation  from  system  (1)  to  system  (4)  are 

(dTi ,'adJf ,g^ }>  »  0,  j  ■  0,1  ,  .  .  . ,  *i  -  ’  and 

i  *1,2,  .  .  . ,  m  , 

^dTcl+l*(*d^f»91 )>  ■  0,  J  ■  0,1,  .  .  .,  *2  -  2  and 

1  ■  1 ,2,  .  .  .,o, 

.(ad-Jf.gi )>  ■  0,  j  ■  0,1 . *n  -  2  and  ► 

”1  ■  1 ,2,  .  .  .,m, 

«Tvf>  i  f  u,<dT1,(.<f'>**f.g1)>  •  Tntl  . 

<«V>  *  jt  u,«T0B_l.t.(.d‘"-1,.g1)>  -  V.  . 

with  the  determinant  of 

<dTcm.i+l.(^1f.9m». 


<dT1,(ad<i"lf,gl)>  •  •  • 
<dTffi+1,(ad,e**lf,gl))  •  •  • 

«VM.l,<«n>rlf.„)>  •  •  • 


being  nonzero. 

The  existence  of  solutions  to  (7)  depends  on  the  Frobenius 
Theorem  mentioned  earlier. 


5.  Existence  of  Trans formatl ons . 


Recall  we  defined  the  following  sets. 

C  *  {gi.Cf.gJ,  .  .  (ad<l‘1f,g1),g2,[f,g2],  .  .  . ,  (ad<2_1f  ,g2) , 

•  •  •»  g^.Cf.g,,].  •  •  ..  Ud*®’^,^)} . 

Cj  •  Cgi.Cf.gi],  •  •  •  »  (adKJ  2f ,gi ) .gj.Cf .92] »  •  •  *.(ad,cj  2f,g2), 

•  •  •.  gm.C^.gn|].  •  •  (adKj"2f,gn)}  for  j  »  1,2,  .  .  .  ,m. 

We  want  to  show  that  our  system  (1)  Is  T-equivalent  to  the  system 

(4)  near  the  origin  If  and  only  if 

1)  the  set  C  spans  an  n  dimensional  space  (l.e.,  the  vectors 
in  C  are  linearly  Independent), 

11)  the  sets  Cj  are  Involutlve  for  j  ■  1,2,  .  .  .,  m  and 

ill)  the  span  of  Cj  equals  the  span  of  O  C  for 

J  *1.2,  •  •  • ,  a, 

all  on  some  neighborhood  of  the  origin  in  Rn.  First,  we  show  that 
condition  1)  Is  necessary  and  sufficient  for  dT1,dT2,  ....  dTn+fn 
to  be  linearly  Independent  and  conditions  11)  and  111)  are  necessary 
in  order  that  a  T-transformatlon  exists.  A  constructive  proof  will  be 
given  for  the  sufficiency  of  conditions  ii)  and  Hi). 

We  compare  the  linear  Independence  of  the  set  of  vector  fields  in 
C  with  the  linear  independence  of  the  gradients  dTlfdT2,  ....  dTn 
assuming  that  Ti,T2,  ...»  Tn+ffl  solve  equations  (7).  Let 
Ci,C2,  . cn  be  constants  and  form  the  vector  field 

a  -  c^i  +  c2[f,gi]  +  .  .  .  +  c0i(adKl‘1f,g1)  +  cffi+1g2 

+  CC1+2^»92^  +-  •  •  •  +  c02^ad  2  f  »g2^  +  •  *  *  +  cffrn_i+l9m 

+  c%-l«Cf^  ♦  •  •  •  +  cn(ad<m‘lf’9m)  -0* 

Taking  the  operation  <*,a)  with  each  dTj,dT2,  ....  dT  and  applying 

•  n 

equations  (7)  and  the  Leibnitz  formula  (2)  until  only 

dTi ,dT_  +i ,  .  .  . ,  dT-.  are  Involved,  we  find  that  the  vector  . 


fields  in  C  are  linearly  independent  if  and  only  if  the  determinant 
of  the  matrix 


<d71,(ad<l”1f,gi ))  .  •  •  <dTi ,(adKl":f ,gn)> 


(d7Ci+1 ,(ad<2"1f,g1)^ 


<dTffi+1,(ad<2*1f,gm)> 


l^dTffti-l+1  »(ad'Cra"lf»9l)>  *  *  *  <dT®ra-in'(ad<ra'lf'9m)M 


1$  nonzero. 

Similarly,  with  constants  bltb2,  .  .  . ,  t>n  we  define  the  one 

form 


6  *  b^dTj  +  b2dT2  ♦  .  .  .  +  b^dT^  *  ®  • 


Taking  the  operation  (9,*)  with  each  vector  field  in  C,  and  applying 
equations  (7)  and  the  Leibnitz  formula  (2)  until  only 
dTi.dT-+l,  .  .  ..  dT*m-  1+l  are  Involved,  we  find  that 
dTi,dT2,  .  .  . ,  dTfl  are  linearly  independent  if  and  only  If  the  deter¬ 
minant  of  the  matrix 


(dT^ad'**^.^))  <dT0i+i,(adK2'1f,gl)> 

•  •  •  <“Tcn,..«-(ad'‘'"'lf-Si)> 

%  * 

•  0  • 

•  •  • 

•  •  • 

<dT1,(ad'ei*1f,gm)  <dTv,  .(ad'^'f.jJ) 

•  •  •  <dT^.!*i'(sd‘"'lf-9n» 

is  nonvanishing. 


Thus,  for  TlfT2,  .  .  .,  T 


dT i ,dT« ,  .  .  . ,  dT 


n+m 


satisfying  (7), 


n+m 


are  linearly  independent  if  and  only  if  the 


vector  fields  in  C  are  linearly  independent  if  and  only  if  the  deter¬ 
minant  of  the  matrix  in  (7’)  is  nonzero.  We  have  used  the  fact  that  in 


T  ■  (Ti,72,  .  .  T^T^,  . 


'  Tn-J  m'*  WW  • 


n+m 


are  functions  of  uj,u2,  .  .  .,  u  .  Also  we  nave  shown  that  to  find  a 

T-relation  between  systems  (1)  and  (4)  we  need  to  solve  equations  (7) 

with  the  matrix  in  (/')  being  nonsinoular. 
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Next,  we  examine  conditions  ii)  and  iii)  under  the  assumption  that 
dTi,dT2,  ....  dIR  are  linearly  independent  (i.e.,  under  the  equiva¬ 
lent  assumption  i)).  The  equations  (d^ ,(adJf ,g. ))  =  0, 
j  *  0,1,  .  .  . ,  <1  -  2  and  i  =1,2,  .  .  .,  m  imply  that  the  vector 
fields  in  span  at  most  an  n  -  1  dimensional  space  at  each  point. 
If  <i  >  <z  we  have  that  the  span  of  Ci  equals  the  span  of  Cj n  c, 
since  the  vector  fields  in  C  are  linearly  independent.  Also  we  have 
that  the  set  Cj  (thus,  the  set  CjO  C)  must  be  involutive  with  d^ 
being  a  nonvanishing  normal  to  the  n  -  .1  dimensional  integral  mani¬ 
fold  of  Ci  H  C  given  by  the  Frobenius  Theorem.  If 
<1  *  *2  =  •  •  •  s  *s  (i.e.,  we  say  appears  in  C  s  times),  for 
some  integer  s  >  2,  then  the  vector  fields  in  Ci  span  at  most  an 
n  -  s  dimensional  space  at  each  point  from  equations  (7)  and  the  linear 
independence  of  dTlf  dT0  +1,  .  .  .,  dT0s  +l .  Again,  because  of  the 
linear  independence  of  vectors  in  C,  the  span  of  C;  is  equal  to  the 
span  of  Cx  n  C.  The  set  Ci  (and  Ci  H  C)  is  involutive  with 
dTi,dTa  +1,  .  .  .,  dT0s  1+1  being  normal  to  the  n  -  s  dimensional 
manifold  of  Ci  n  C  given  by  Frobenius. 

For  our  present  purpose  the  interesting  equations  from  (7)  are 

<dTi  ,(ad^f  ,g^ ) '/  ■  0,  j  *  0,1,  .  .  . ,  Ki  -  2  and 
l  =1,2,  .  *  . ,  m  , 

<dTffi+1 .(ad^f.g^ )>  =  0,  j  =  0,1,  <2  -  2  and 

i  *  '1 1 2 1  •  •  . ,  m  , 


(8) 

<dT0(D-i+l»fadJf*9l^  »  0,  j  =  0,1,  .  .  .,  <m  -  2  and 

1  •  •  •  i  in  t 

From  the  Leibnitz  rule  (2)  we  have 

<dTz,(adjf,gi)>  =‘-<dT£+1,(adj"1f,g1)>  , 

and  this  can  be  used  repeatedly.  Thus,  given  a  j,  1  <  j  <  m,  since 
dTi,dT2,  ....  dTR  are  linearly  independent,  the  number  of  linearly 
independent  vector  fields-  in  C.  is  n  -  (p/m),  where  p  is  the 

J  U 

number  of  equations  in  (8)  for  which  the  k  in  (ad  f,g^)  is  greater 

than  or  equal  to  -  2.  This  is  exactly  the  number  of  linearly 

independent  vector  fields  In  C.  HC.  From  the  set  TltT2,  .  .  . ,  T 
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there  are  n  -  [(n  -  (p/m)]  *  p/m  functions  with  linearly  independent 

gradients  that  are  normal  to  the  intearal  manifold  of  C .  ^  C  (or 

J 

ecuival ently  of  ) . 

w 

Hence,  the  necessity  of  conditions  ii)  and  iii)  are  proved.  We 
now  state  our  main  result  and  complete  its  proof.  We  must  Keep  in  mind 
that  a  reordering  of  the  vector  fields  glfg2,  .  .  .,  g^  has  been  done, 
if  necessary. 

Theorem  5.1 :  The  system  (1)  is  T-equivalent  to  the  system  (4), 
where  the  state  variables  .  .  . ,  xn  lie  in  a  sufficiently 

small  open  neighborhood  V  of  the  origin  in  3Rn,  if  and  only  if  con¬ 
ditions  i),  ii),  and  Iii)  hold  on  V. 

Proof .  The  necessary  part  of  the  theorem  has  been  proven.  Thus, 
we  wish  to  construct  a  T-transformation  given  the  three  conditions 
and  remember  that  our  transformation  will  be  nonsingular  by  i).  Since 
the  span  of  equals  the  span  of  n  C  for  j  *  1 r2,  .  .  ra, 
we  have  that  each  Cj  n  C  is  involutive  and“work  with  only  those 
vector  fields  in  C. 

Before  we  build  T  we  assume  that  the  following  conditions  hold.  - 
Suppose  all  entries  when  evaluated  at  the  origin  In  the  matrix  C7‘) 
above  the  diagonal  are  zero.  After  the  proof  is  completed  we  will  snow 
that  no  generality  Is  lost  In  making  this  assumption.  Hence,  the 
Tj.Ta,  .  .  .,  Tn+ffl  that  we  define  have  linearly  independent  gradients 
and  the  matrix  in  (7')  is  nonsingular  if  and  only  if  the  diagonal 
elements  in  this  matrix  do  not  vanish. 

We  now  construct  a  solution  T  ■  (Tj.Tj,  ....  Tn+ffl)  of  equa¬ 
tions  (7).  Since  Tn+l»7n+2»  •  •  •»  Tn+1„  are  found  by  the  last  m 
equations  in  (7),  we  need  only  find  solutions  Ti,Ta,+1,  ....  Tff  +1 
to  the  partial  differential  equations  (8).  Recall  that  first-order 
linear  partial  differential  equations  are  solved  by  reducing  to  systems 
of  ordinary  differential  equations.  We  now  give  the  ordinary  differ¬ 
ential  equations  to  be  examined  and  then  choose  our  solutions 

•  •  •»  Tam-i+i-  **  should  be  evident  that  these  m  functions 
will  not  be  unique. 

Let  Si  be  the  number  of  times  that  <i  -  1  appears  in  C  (e.g., 

if  <i  •  <z  >  <3,  then  Si  »  2);  s-  be  the  number  of  times  <i  -  2 

appears  in  C,  .  .  .,  s*  be  the  number  of  linearly  independent  vectors 

in  {gi.az,  .  .  .,  g  > ,  which  by  assumption  is  m. 
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We  introduce  real  parameters  tltt2.  .  •  . ,  t  in  the  following 
technique.  We  first  solve 

d*Ui) 

-itrs  (ad  f>9i] 

with  initial  conditions  x(0)  *  0  to  find  the  unique  integral  curve  of 
(adKl"lf,gi)  through  the  origin  in  (xi,x2,  ....  xR)  space.  If 
si  >  2,  we  next  solve  the  system 

dx(t,.t2) 

— dt—  *  lad  1  f-92> 

with  initial  conditions  x(ti,0)  *  x( ti )  to  find  the  solution  with 
x(0,0)  *  0.  We  continue  in  this  manner  si  -  2  more  times  finally 
finding  the  solution  of 

dx(tltt2,  .  .  .,  ts.)  *  -i 

- - ;*(ad  f-  9s,) 

satisfying  x(tltt2,  .  .  .,  ts  *0)  *  x(ti,t2 . 

We  now  approach  the  partial  differential  equations  (8).  Finding 
a  solution  z  to  the  equation  <dzt(adKl"2f .g^)  *  0  is  the  same  as 
solving 

*  (ad^f.g  )  ,  *  0 

1  3tSj+i 

with  initial  conditions  x(tlft2 . tSl,0)  *  xUi,t2,  •  •  •»  t$i). 

We  let.  z  denote  the  unknown  function  for  each  equation  we  consider. 

If  $2  >  1  for  <dz,(ad<1"2f ,g2))  ■  0  we  examine 

37 - *  (ad  1  f.g  )  ,  tt - -0 

^Si+Z  2  oIS1+2 

satisfying  x(t!,t2,  .  .  .,  tSl+1,0)  *  x(tlft2,  .  .  .,  t$1+i).  Doing 
this  s2  times  takes  us  through  the  partial  differentiation  equation 
<dz,(ad'Cl‘2f ,gS2)>  *  0  and  parameter  tSl+s2* 

Next  we  consider  <dz,(ad<1_3f ,g 2 )>  *  0  and  take  the  system 


l-s1+s2+i 


■  (.d'l-’f.g  )  .  rr-^ - 

-1  8lSi+S2+l 


'  more  times 


with  initial  conditions  x(tt,t2 . t$i+s^,0) 

»  x(ti,t2,  .  .  tSl+s.).  We  repeat  this  process  S3  - 
if  s2  >  1  and  end  with  the  system 


dx 


dt 


Si+Sr+S, 


(ad 


«  \  at 

'  •  9$  a  /  *  1  + 

s3 


+s2+s3 


and  parameter  ^s^Sj+Sj  associated  with  (dz,(ad*l"3f ,g$  ))  ■  0. 

Then  we  examine  (dz.Ud*1"^ .g^)  *  0  and  continue  this  process 

in  order,  introducing  all  parameters  tj‘tt2 . tn  and  ending  with 

the  equations  (recall  Si  +  s2  +  .  .  .  +  sKi  »  n) 


dx 


.  _3Z_ 

*  3t 

n 


0 


satisfying  x(tj,t2.  ....  tn_, ,0)  »  x(tltt2,  ....  tn*  ).  In  this 
way  we  treat  (dz.g^)  «  0. 

We  have  constructed  a  mapping  from  FR  .  to  Rn  given  by 

(ti,t2»  *  •  ..  tn)  ♦  (*l (t^ ,t2 »  •  •  •»  tfl)  ,X2(ti ,t2 ,  .  .  .,  tn), 

•  •  •*  •  *  *» 


and  taking  the  origin  to  the  origin.  The  Jacobian  matrix  of  this 
mapping 


3Xx 

3*1 

8X3 

3tl 

3t2 

3tn 

3*2 

3x2 

3x2 

3ti 

3t2 

•  • 

3tn 

3t2 


is  called  the  noncharacteristic  matrix.  When  evaluated  at  the  origin, 
its  columns  are  (ad<l"1f,gi) ,(ad<2"1f ,o2) ,  ....  [f ,gm3 .3i »  •  •  •  .  gffl. 
the  vector  fields  in  C  all  evaluated  at  (0,0,  .  .  .,0).  Thus,  the 
matrix  Is  nonsingular  and  the  above  mapping  (xi(ti,t2,  .  tn), 

x2(tlft2i  .  .  .,  tn),  .  .  .,  xn(ti,t2,  .  .  tn))  Is  a  dlffeomorphism 
on  an  open  neighborhood  W  of  the  origin  In  lRn.  We  let  V  be  a  - 
sufficiently  small  open  set  about  (0,0,  .  .  0)  in  the  image  of  W 


t 


under  the  mapping.  Moreover,  we  can  solve  for  'tltt2 . t  as 

functions  of  xltX2,  .  .  . ,  xfl  by  the  inverse  function  theorem. 

We  now  define  the  functions  T3 ,Tff  +1 ,  ....  Tom.>+i  t0  solve 
equations  (8).  By  design,  each  maps  the  origin  to  the  origin  and  is 
<&m  on  V.  Let  Ti  -  tj  and  note  that  3T1/3t1  •  0,  i  *  2,3,  .  .  . ,  n. 

We  must  show  that  <dTlfq>  ■  0  for  every  element  q  in  C2 n  C  (this 

is  the  same  as  the  equations  involving  Tj  in  (8)).  From  the  Frobenius 
Theorem  we  know  there  are  Integral  manifolds  of  the  vector  fields  in 
Ci  n  C.  By  our  construction,  for  fixed  tj,^,  •  .  •.  as  we  let 

ts1+i,tSl+2 . tn  vary  we  obtain  such  an  integral  manifold.  Thus, 

Tj  *  t2  is  a  constant  on  each  such  manifold  and  <dTj,q>  «  0  for  all 

q  In  Cj  n  C.  Note  that  t2  Is  the  parameter  associated  with  the 

vector  field  (ad*1*^  ,g3).  Next,  we  define  TCl+l  *  t,  where  t  is  the 
parameter  that  was  introduced  when  we  solved  dx/dt  «  (adK2”If,g2).  For 
example.  If  Sj  ■  2,  TGl+1  «  t2  ■  t,  or  if  s2  ■  1,  T„l+1  ■  t3  «  t.  We 
must  show  that  (dT0  +1,q)  *  0  for  all  vector  fields  q  in  C2n  C. 

Since  C2  O  C  is  involutlve,  there  is  a  subdivision  of  the  set  of 
indices  tj,t2,  .  .  . ,  tn  into  2  subsets  S  «  |tltt2«  ....  tp|and 

S  ■  {V+i’V+z’  *  *  *“»  tn}  for  sorae  r  50  th#t  t  c  S  and  for  fixed 
parameters  in  S  we  get  integral  manifolds  of  C2  r\  C  when 
tr+1»tr+2».’  •  •»  vai"y-  Thus»  ^oi+i  constant  on  each  such 
manifold  and  (dT0  +lfq)  ■  0  for  all  q  e  C2HC. 


We  define  T<,2+1  to  be  the  parameter  associated  with  the  vector 
field  (ad<3*1f ,g3) ,  and  continuing  in  this  manner,  we  Introduce 
to3+i»to4+i»  •  •  •»  T0m_1+i  to  solve  (8).  Also,  we  have 

<dTx ,(adICl”1f ,gi)>  t  0,  <dTai+1,(ad'C2“1f,g2)>  *  0 . 

<dT0m_i+l(ad,em“1f,g)n)>  t  0.  By  a  comment  near  the  beginning  of  this 
proof,  we  know  that  our  transformation  is  nonsingular  because  the 
diagonal  elements  in  (7')  are  nonzero. D 

We  point  out  how  to  redefine  gi,g2,  .  .  •»  9m»  1 f  necessary,  so 
that  each  entry  evaluated  at  the  origin  In  (7')  above  the  diagonal  is 
zero.  This  technique  does  not  alter  the  hypotheses  of  Theorem  4.1. 


^  We  take  g2  «  g2  and  replace  g2  by  92  -  e129i»  where  e12  is 

a  constant  so  that  the  vector  fields  (adICl‘1f,g1 )  and 
(ad<1-1f,gi  -  @1292)  have’zero  inner  product  at  the  origin.  We  call 
this  g2  -  *i29i  our  new  92*  Next  we  replace  g3  by 

9  93  -  81391  -  *2392 »  where  e23  and  e23  are  chosen  so  that  the  Inner 
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products  of  (ad*l"‘f ,g: )  with  (ad*:”'f,g3  -  e-  3gi  -  e23g-) 

(ad'C2"if ,32)  with  (ad<2*lf,g3  -  ei3gi  -  e23g2)  vanish  at  (0,0,  .  .  .,  0). 
This  g3  -  «i3gi  -  e23g2  becomes  our  new  g3.  We  continue  in  this  way, 
the  last  step  being  to  take  e.-te,_ . e_  ,  to  make  the  obvious 

lfll  aJn  fit- 1 

vector  fields  orthogonal  at  the  origin,  and  gm  -  e,R|gl  -  e2|Bg2  -  .  .  . 

*  Vi  Al  15  our  n,w  V 

Since  Tt  ■  t3,  dx/dt3  ■  we  have 

<dT1,(ad,C2"lf,g2)>  -  0,  <dT2 ,(adK*’lf ,g3)>  -  0 . 

<dTL ,(adICl~1f .g^))  ■  0,  all  evaluated  at  the  origin.  Because 
Tc1+i  *  t.  dx/dt  «  (ad,C2“1f,g2) ,  we  find  that  <dTai+l,(ad*2"lf,g3)> 

•  0,<dTCl*1.(ad,C2'lf,g4)>  -  0_,  .  .  .  ,<dT0l+l  ,(ad*2’lf .gj)  -0  when 
(xltx2,  .  .  . ,  xfl)  •  (0,0,  .  .  .,0).  Repeating  this  argument  m  -  3 
more  times  we  show  the  entries  In  (71)  above  the  diagonal  vanish  at  the 
origin. 

An  Illustrative  example  Is  In  order. 

Example  5.2:  Consider  the  nonlinear  systmn 


■  •  m 

*1 

sin  x2 

•0* 

'O’ 

*2 

sin  x3 

0 

0 

*3 

m 

X3 

*u 

♦  “1 

1 

♦  u2 

0 

x«. 

x5  ♦  X3  -  x|° 

0 

0 

.X5- 

0 

c 

•  m 

•  f  *  uigi  +  u2g2 

on  Fs. 

Now  we  compute 


0 

m  • 

cos  x2  cos  x3 

V 

•cos  x3 

0 

0 

0 

,  (ad2f,gl)  ■ 

0 

[f.gil  * 

0 

0 

0 

1 

.  0  . 

• 

L  0 

.0. 

Hence,  C  •  {gi,[f,gi],(ad2f,gl),g2,[f»g23}  spans  a  5-dlmensional  space 
on  the  set 
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v  »  |(x1,X2,x3,X4,x5):  -  J  <  x2,x2  <  . 

The  appropriate  Kronecker  indices  in  this  case  are  <i  *  3  and  <2  *  2. 

We  have  C\  n  C  »  (gi .[f tg1Lg2.tf.g2]>  is  involutive  since 
0  ' 
n  x3 
0 
0 
0 

and  all  other  Lie  brackets  vanish,  and  C2  n  C  •  {g!,g2}  is  trivially 
involutive.  Hence,  there  is  a  transformation  which  maps  our  nonlinear 
system  to  the  appropriate  Srunovsky  canonical  form. 

We  exhibit  such  a  transformation  for  (x*  ,x2  ,x3  ,x4,x5)  in  V  by 
the  construction  in  the  proof  of  Theorem  5.1.  The  solution  of 
dx/dtj  *  (ad2f,g1)  with  x(0)  «  0  Is  x^t.)  *  t1,x2(t1)  «  0 .x3 (<tl )  «  0 , 
X4(tj)  *  0,  and  xs(t1)  ■  0.'  Similarly,  for  dx/dt2  *  [f,gi]  with 
x(tlf0)  »  x(tx)  we  find  xi(t1(t2)  »  tl,x2(t1,t2)  *  -t2,x3(t!  ,t2)  »  0, 
x4 ( t^ , t2 )  *  0,  and  xs(t3,t2)  »  0.  For  dx/dt3  *  Cf *923 *  dx/dt4  *  g3 , 
and  dx/dts  *  g2  In  that  order  and  with  the  appropriate  initial  condi¬ 
tions  we  have  x1(t1 ,t2,t3,t4,t5)  -  tl,x2(t1,t2,t3,t4,t5)  *  -t2, 

X3U1  ,t2,t3,t4,t5)  *  t4 ,x4(t^ ,t2 ,t3 ,.t4 ,t§)  *  t3,  and 
x5(t1,t2,t3,t4,t5)  «  t5.  Certainly,  the  noncharacteristic  matrix  Is 
nonsingular  and  solving  for  t* ,t2,t3,t4,ts  as  functions  of 
xi,x2,x3,x4,x5  we  find  t3  ■  xltt2  ■  -x2,t3  ■  x4,t4  *  x3,  and  t5  »  x3. 

Then  one  such  transformation  T  *  (Tl,T2,  .  .  .,  T  )  is 

Ti  “  xi 

T2  «  sin  x2 

T3  *  (cos  x2)sin  x3 

T4  «  x4 

T 5  "  x5  +  xj  -  xj° 

T6  *  (-sin  x2)sin2  x3  +  (cos  x2  cos  x3)(ui  +  x£) 

T7  ■  u2  +  3x2(x5  +  x2)  -  1  Ox j  (sin  x2)  . 
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Siven  the  system  (1)  we  choose  the  Kronecker  indices  in  the  fol¬ 
lowing  way.  First  we  form  the  matrix 

r  ..  3=  ■  ■  •  S* 

lf.ul  ■  ■  ■  tf.S.3 

(ad^f.gj  (ad2f ,g2)  •  •  •  (ad2f,gm) 

•  •  • 

•  •  • 

•  •  • 

adn  1  •f »gi )  (adn  l*,g2)  .  .  .  (adn  *f,gfn)_ 

We  assume  that  all  numbers  we  define  are  constant  in  some  neighborhood 

of  the  origin  In  Fn.  Set  a<)  ■  number  of  linearly  independent  vector 

fields  In  the  first  row;  aj  *  number  of  linearly  independent  vector 

fields  In  the  first  two  rows,  .  .  and  a  ■  number  of  linearly 

independent  vector  fields  in  the  matrix.  Take  r0  »  a0»  ri 

»  si  -  a o,  .  .  . ;  r  ,  ■  n  ,  -  a.  „  and  define  to  be  the  number 
*  w  n»i  n**  n«2  ■ 

of  r j  with  r j  >  1 . 

If  necessary,  we  renumber  gltg2,  .  .  .,  gm  and  see  If  the 
hypotheses  of  Theorem  5.1  are  satisfied  to  determine  If  our  system  is 
transformable. 

In  [3]  the  authors  blended  the  local  results  for  the  single  Input 
case  with  versions  of  the  global.  Inverse  function  theorem  to  yield 
global  transformation  results.  Of  course  analogous  global  results  can 
be  derived  for  the  multiple  Input  case  with  the  local  theory  found  in 
this  paper. 

Another  problem  on  which  the  authors  have  made  progress  is  the 
existence  of  transformations  to  time- invariant  linear  systems  for  time- 
varying  nonlinear  systems  (see  [4]}.  We  can  also  show  that  if  a  non¬ 
linear  system  can  be  mapped  to  a  linear  system,  then  it  must  De 
"reducible1'  to  the  form  (1). 

A  conversation  with  Roger  3rockett,  Narvara  University,  on  the 
multi-input  transformation  problem  proved  valuable  to  the  authors.  He 
wish  to  thank  Eduardo  Sontag,  Rutgers  University,  for  sending  us  a  copy 
of  the  paper  [5]. 


Appendix 

APPLICATION  TO  A  HELICOPTER 


The  helicopter  is  represented  by  a  rigid  body  moving  in  3- 
dimenslonal  space  in  response  to  gravity,  aerodynamics,  and  propulsion. 
The  state, 

x  »  (r,v,C,u)T  CICF3  *  F3  *  S0(3)  *  F3  (A-l 

where  r  and  v  are  the  inertial- coordinates  of  body  center  of  mass 
position  and  velocity,  respectively;  C  is  the  direction  cosine  matrix 
of  the  body-fixed  axes  relative  to  the  runway-fixed  axes  (taken  to  be 
inertial).  The  attitude  C  moves  in  the  Lie  group  S0(3).  The  body 
coordinates  of  angular  velocity  are  represented  by  u. 

The  controls, 

u  •  (uM,uP)TC  UC  F3  *  R  (A-2 

where  u^  is  the  3-axls  moment  control,  that  is,  roll  cyclic  and  pitch 
cyclic,  which  tilt  the  main  rotor  thrust,  and  the  tail  rotor  collective 
which  controls  the  yaw  moment;  and  uF  is  the  main  rotor  collective, 
which  controls  the  main  motor  thrust. 

The  effectively  12-dimensional  state  equation  consists  of  the 
translational  and  rotational  kinematic  and  dynamic  equations: 

f  *  v 

v  *  fF(x,u) 

(A— 3 

c  «  su)c 

u  *  f^(x,u) 

where  fF  and  f^  are  the  total  force  and  moment  generation  processes, 
and  (x,u;  are  defined  by  (A-l)  and  (A-2). 


The  canonic  model  has  Kronecker  indices  {-,4,2,2}  and  the  vari¬ 
ables  are  identified  as  shown  in  Figure  A-l ,  where  •  represents 
a  scalar  Integrator,  y-  C  R4  Is  the  canonic  control  and  t,  is 
the  direction  cosine  matrix  representing  the  heading  of  the  helicopter. 


Figure  A-l.  Canonic  Model  of  (A-3). 


He  do  not  go  into  details  of  the  nonlinear  system  nor  the  con¬ 
struction  of  the  linearizing  transformation  here.  However,  we  mention 
that  we  have  a  method  to  build  an  approximate  transformation  and  the 
accuracy  of  this  technique  Is  Illustrated  through  the  simulation 
resul ts . 

The  experiment  consists  in  automatically  flying  a  trajectory 
which  exercises  most  of  the  flight  envelope  of  the  UH-1H  helicopter 
as  shown  In  figures  A-2  and  A-3. 

Unlike  the  coarse  accelerations  in  figure  A-3,  the  model  accel¬ 
erations  (figure  A-4)  are  smooth,  as  Is  the  vertical  velocity  v3(J  . 

The  second  panel  In  figure  A-4  shows  the  effects  of  neglected  parasitic 
effects  in  the  construction  of  the  linearizing  transformation.  As  can 
be  seen  the  acceleration  errors  are  quite  small  being  less  than  0.5  g. 
The  regulator  controls  these  effects  by  means  of  position  errors. 

The  resulting  horizontal^  error  is  less  than  6  ft;  while  the  vertical 
error  is  below  1  ft. 


Figure  A-4.  System  Response-Canonic  Variables 


The  natural  controls  u  =  (uf1,u?),  shewn  in.  figure  A-5  are  well 
behaved. 


0  100  200  300  400  500 


RELATIVE  TIME,  sec 

Figure  A-5.  Response  Natural  Controls. 

In  summary,  the  performance  of  the  system  is  oood.  Future  tests 
will  exercise  the  system  with  more  taxing  trajectories. 
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ROBUSTNESS  IN-  NONLINEAR  CONTROL 

Renjeng  Su,  George  Meyer  and  L.  R.  Hunt 

Abstract 

A  new  design  methodology  for  nonlinear  plants  using 
transformations  of  nonlinear  systems  to  linear  systems  Is  presently 
being  developed.  It  is  the  purpose  of_this  paper  to  show  that  this 
design  theory  Is  robust.  If  the  linear  system  Is  asymptotical ly 
stabilized  by  applying  appropriate  feedback  {a  well-known  technique), 
then  a  control  to  stabilize  the  nonlinear  plant  Is  easily  computed 
through  that  part  of  the  inverse  transformation  involving  controls. 
Most  Importantly,  all  nearby  plants  (In  the  proper  topology)  are  also 
asymptotically  stabilized  using  this  control.  Lyapunov  functions 
for  nonlinear  systems  can  be  found  using  this  method.  A  short 
discussion  on  the  application  of  this  design  technique  to  the 
automatic  flight  control  of  aircraft  is  presented. 


ROBUSTNESS  IN  NONLINEAR  CONTROL 


Renjeng  Su1 
George  Neyer2 
L.  R.  Hunt3 


I.  Introduction 

Suppose  we  have  a  nonlinear  system  of  the  form  x(t)  •  h(x.u) 
where  h  Is  a  f1  function  of  the  state  xcRn  end  of  the 
(m- tuple)  control  vector  u.  If  we  choose  a  control  vector  uQ  and 
If  h(*.u0)  vanishes  when  x  *  0,  does  the  asymptotic  stability  of 
the  zero  solution  Imply  the  asymptotic  stability  of  equilibrium 
points  of  nearby  control  systems  (In  an  appropriate  topology)  with 
the  control  uQ  driving  these  systems?  We  work  in  some  fixed  set 
in  Fn  which  contains  the  origin. 

In  our  version  of  this  problem  we  have  a  nonlinear  plant  that 
we  wish  to  control.  The  mathematical  model  of  the  plant  is 


m 

x(t)  ■  f(x(t)]  ^  u1  (t  )9i  [x(t )]  .  x  s  (1 

i»l 

where  f,gt,  ...  ,  gm  are  vector  fields  which  are  e“  in  an  open 
set  W  in  Rn  containing  the  origin  and  f(0)  »  C.  We  assume 
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that  there  Is  a  nonsingular  one-to-one  transformation  T  napping 
our  system  (1)  to  a  controllable  1  inoar  system  In  tht  approprlata 
Brunov sky  canonical  form  [1].  Thus,  our  system  (1)  transforms  by 
T:W«R®.  T(M«R*)C  Rn*||*  with  T(0)  ■  0  Into 

y(t)  •  Afly  ♦  afl¥  (2) 

whom  y  and  v  aro  tha  canonical  state  and  control. 

The  design  technique  Is  to  build  a  controller  for  the  nonlinear 
system  by  designing  one  for  the  linear  canonical  system.  For 
aircraft  we  use  the  canonical  form  to  determine  the  control  to  fly  a 
reference  trajectory  and  to  stabilize  the  system  about  this 
trajectory.  Using  the  transformation  (from  nonlinear  to  linear)  and 
Its  Inverse  we  obtain  an  exact  nonlinear  model  follower  [2-4). 

In  this  article  we  use  linear  feedback  to  asymptotically 
stabilize  the  linear  system  about  0,  and  this  in  turn  asymptotically 
stabilizes  the  nonlinear  system  about  the  origin  in  its  state  space. 
We  show  that  all  system  dote  (in  the  topology  we  introduce)  to  the 
mathematical  model  are  stabilized  (asymptotically)  about 
corresponding  equilibrium  points,  and  the  stability  holds  for  any 
trajectory  starting  in  some  fixed  compact  set  in  state  space 
(of  course,  the  usual  1  inearl zatlon  techniques  are  local  in  nature). 
In  this  wey  we  prove  that  our  design  technlcue  is  robust.  At  the 
and  of  this  paper  we  discuss  applications  of  our  approach  to  various 
aircraft  and  some  flight  test  results  which  indicate  the  robustness 
of  our  technique  In  actual  practice. 

Lyapunov  functions  play  a  key  role  in  our  theory.  If  linear 
feedback  Is  used  to  asymptotically  stabilize  the  canonical  system, 
then  It  Is  easy  to  construct  Lyapunov  functions.  It  Is  shown  that 
the  composition  of  these  functions  with  the  transformation  yield 
Lyapunov  functions  for  the  nonlinear  system 

m 

*(t)  •  f[x(t)]  *  ^  ut(t)g5 (x(t)) 
i«l 

(which  we  tfbuld  also  write  as  x  ■  f  ♦  Gu,  S  and  u  having  obvious 
definitions)  with  the  controls  u,,  f  •  1,2 . m,  corresoonding 


to  the  linear  feedback  appliec.  If  a  nonlinear  system  is  sufficiently 
dose  to  the  above  system  (In  the  ('•  topology  on  a  particular 
compact  set  containing  the  origin),  we  achieve  asymptotic  stability 
for  this  nearby  system. 

We  emphasize  linear  feedback  on  the  linear  system,  but  the 
technique  can  be  applied  for  many  asymptotically  stabilizing  controls 
for  which  we  can  find  a  Lyapunov  function  whose  level  sets  (sets 
where  this  function  is  constant)  shrink  to  the  origin  in  the  state 
space  of  the  linear  system. 

Section  2  of  this  paper  contains  a  review  of  the  transformation 
theory  for  taking,  nonl Inear  systems  to  linear  systems.  In  section  3 
we  present  standard  definitions  and  show  how  to  construct  Lyapunov 
functions  using  the  transformation.  Our  main  theory  concerning  the 
robustness  of  the  design. technique  using  transformations  is  proved 
in  section  4.  Since  the  transformation  theory  exists  for  time-varying 
nonlinear  systems  (see  [5-6);  Hunt,  L.  R.,  Su,  R . ,  and  Meyer,  G. 

Time  Varying  Systems,  article  in  progrfts).  we  mention  how  to  extend 
our  results  to  these  systems  in  the  last  section. 

2.  Transformations 

We  are  interested  in  recessary  and  sufficient  conditions  to 
transform  the  nonlinear  system  (1)  to  the  canonical  form  (2). 
Historically,  theorems  in  this  direction  can  be  found  in  references 
(2)  and  (3),  and  in  the’work  of  Kroner  f 7 J ,  Brockett  [8],  and 
Jakubczyk  and  Respondek  (9).  Much  of  the  material  presented  here  is 
discussed  in  references  (10-12). 

We  begin  by  introducing  the  Xronecker  indices  and  Brunovsky 
canonical  forms.  Suppose  we  examine  the  linear  system 

y  ■  Ay  ♦  Bv  (3) 

where  ycRn,  v* F1",  and  A  and  B  are  appropriately  sized 
matrices.  This  system  is  assumed  controllable,  that  is,  the  span 

n.  i 

of  ;S ,  A8,  ...  ,  A  -B)  is  n  dimensional.  We  set 


r  *  rank  B 
o 

r^  »  rank  (8,  A8,  ...  ,  A^B)  -  rankfB,  AB . AJ*B) 
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for  1  S  J  S  n  •  J,  ind  have  that  0  S  i  ie  for  0  s  j  S  n  -  1  and 
53  rj  "  "*  «h*  Kroneckar  Inde*  is  the  number  of 
r j  *  i  •"*  r«“r*  «*«t  *,  2  <j  ...  i  <m  and  fs ml  «  -  n. 

It  1*  Mil  known  Met  system  (3)  Is  equivalent  to  our  linear 
*yata«  In  Brunovsky  canonical  form  (2)  with  a  new  control  »,  such 


/  *  V  *  V 


Here  A  equals 
0  • 


At  0 


C  1  0 
0  0  1 


0  C  0 
0  0  0 


and  8„  equals 
0 


*  »  >*«*«"  I 


■ .  • . ' >. -  -v  «  •  ■'  -  .• 


WWW 


i  ^ 

4 

f  - 

r. 

r 


!• 

c 

i* 

i 

« 

r 


■3 


■i 

b2 


where  B.  Is  the  «,xm  matrix  whose  only  nonzero  element  Is  a  1 

"*"*  '  ♦  kk 

In  the  last  row  and  1  column. 

We  define  the  type  of  transformations  that  are  of  Interest  to 


A  €m  transformation  T  ■  (T*.  Tj . T 


us. 


set  W»R 


in 


»n+m 


n+m' 


maps  an  ooen 


( C*1 .  *2*  »  *n*  u!>  u2» 

space)  onto  an  open  set  In  Fn4in 


u  i 
rn 


(x.u) 


((y>.  y2.  •••  .  y„.  *«.  v2 . vm5  »■  (y*y)  space)  containing  the 

origin,  where  U  Is  an  open  set  In  Fn  containing  the  origin,  so 
that  the  following  properties  hold: 

I)  T(O)  «  0; 

II)  Tj,  T2 . Tn  are  functions  of  x  only 

and  have  a  nonsingular  Jacobian  matrix  In  W; 

III)  T  ,  T  ,  ...»  T  are  functions  of  (x.u)  and  for 

fl+l  fl+z 

fixed  x  In  w,  the  m*m  Jacobian  matrix  of  T  ,  Tn+2>  ...  , 


with  respect  to  u  Is  honsingular ; 

1  v )  Tt  ■  ylt  T,-  y2 . Tn  "  yn  4re  the  State  v*ri*b1es 

And  I-..,,  •  vi»  T,.-,  *  v 2,  ...  ,  T„  •  v_  are  the  controls  for  the 
rm  *  n+z  *  n*tn  m 

linear  time-invariant  system  (2);  In  other  words,  T  maps  system  (1) 
to  systam  (2);  and 

v)  T  ■  (Tj,  T-,  ...  ,  T  )  Is  a  one-to-one  map  of  WxF"1 
onto  an  open  set  containing  the  origin  in  (y,v)  space. 

If  we  fix  the  controls  Tn+I ,  ~n+2,  ...  ,  then 

( u i ,  ui ,  . . .  ,  u  )  *  u  Is  fixed,  the  transformation  T  is 

*  in 

restricted  to  W,  and  the  only  coordinate  functions  involveo  are 


We  need  several  additional  definitions  before  stating  the 
theorem  on  conditions  for  the  existence  of  transformations  from 


V*  vt  «-•  *■  *-•  v*.: 
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reference  [12].  We  Motion  that  a  mat  hod  for  constructing  a 
transformation  is  also  available  In  rafaranca  [12]. 

If  f  and  g  am  fi"  vector  fields  on'  Fn,  the  Lie  bracket 
of  f  and  g  is 


where  ig/»x  and  it /as  are  Jacobian  Matrices.  We  sat 

(ad#f,g)  •  g 
(adlf,g)  -  [f.g] 

(ad:f,g)  «  [f, [f.gll 


(adkf,»)  .  [t,{%tk‘-t.9)\ 

A  collection  of  «"  vector  fields  hlf  h: . hr  on  Rn 

is  ineolutiee  if  there  exist  C"  functions  with 

r 

[h(.bj][x)  •  ^  Y1jk(x)hk{x)  ,  !si,  jSr.ifJ 
k-l 

We  return  to  systee  (1)  and  define  the  sets 


C  •  {Ji.(f.9il.  ...  •  («d  Mi). 
9t*l^»92i*  ...  * 

...  •  9a>ti*9BJ»  •••  •  (Ad  ^>9B)3 
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Theorem  2.1 

There  exists  a  transformation  with  properties  1)  through  v)  if 
and  only  if  on  U 

1)  the  set.  C  spans  an  n  dimensional  space, 

2)  each  set  Cj  Is  involutive  for  j  »  1,2,  ...  ,  m.  and 

3)  the  span  of  equals  the  span  of  C  -for 

j  *  1.2 . m. 

Theorem  2.1  as  stated  in  reference  [121  is  local  in  nature 
(it  holds  for  some  neighborhood  of  the  origin  in  Rn),  but  for  the 
sake  of  simplicity  in  notation  we  have'taken  this  neighborhood  as 
our  set  U.  Global  transformation  results  can  be  found  in 
reference  [11] . 

There  is  also  a  version  of  this  theorem  (Hunt,  L.  R.,  Su,  R., 
and  Meyer,  G.  Time  Varying  Systems,  article  in  progress)  for  time- 
varying  systems  which  we  discuss  later. 

Since  the  main  topic  of  this  article  is  the  preservation  of 
asymptotic  stability  under  small  perturbations  of  transformable 
systems,  we  turn  our  discussion  to  stability  and  Lyapunov  functions. 

3.  Stability 

Let  x  be  an  equilibrium  state  for  the  system  of  differential 
equations  x  *  h(x),  where  h  is  el  in  some  set  W  of  the  state 
space  which  contains  x. 

Definition  3.1 

The  point  I  is  a  staple  equilibrium  if  for  every  neighborhood 
U  of  x  in  Rn  there  is  a  neighborhood  Uj  of  x  in  U  such 
that  every  solution  x(t)  with  x(0)  in  U;  is  defined  and  in  U 
for  all  t  >  0.  If  U[  can  be  chosen  so  that  in  addition, 

1 im  x(t)  ■  x  , 

t-0 


then  x  is  asymptotical!  y  stable.  Given  a  compact  set  K 
containing  x  in  its  interior,  x  is  called  asyraototicall v  stable 
on  K  if  it  is  stable  and  every  solution  starting  in  K  converges 
to  x. 

Definition  3.2 

Let  V:U  -  S  be  a  continuous  function  defined  on  a 
neighborhood  U  of  x,  differentiable  on  U  -  (x) ,  such  that 

(a)  V(x)  ■  0  and  V(x)  >0  if  x  *  i 

(b)  V  <  0  in  U  -  Cxi  . 

Then  V  is  a  strict  Lvaounov  function  for  x. 

Suppose  »e  take  a  linear  feedback  control  and  apply  It  to 
system  (2)  to  stabilize  (asymptotically)  the  system  about  the 
origin,  with  the  control  substituted  we  have  a  linear  system  of 
differential  equations 

y  *  V 

where  the  eigenvalues  of  CQ  have  negative  real  parts.  Choosing  a 
negative  definite  matrix  Q.  the  equation 

C.*P  ♦  PC.  ■  Q  (*  denotes  transpose) 

0  0 


yields  a  unique  positive  definite  solution  P,  and 
V(y)  •  y*Py 

Is  a  strict  Lyapunov  function  (see  ref.  (131,  p.  51).  Ue  remark  that 
the  level  surfaces  of  this  Lyapunov  function  shrink  to  the  origin. 

Now  y,  ■  7,,  y2  ■  Tj,  ...  ,  yn  •  Tr  are  functions  of  x  and 
thus  V  depends  on  x.  Cououtlng,  we  find 


i-i  1 


£ 


i-i 


2M2 


^  'i,  j  ^  vy, 
j«l  J  J-l  j 


which  we  know  to  be  negative  away  from  the  origin  in  y  space. 
Hence,  V  is  a  strict  Lyapunov  function  for  the  origin  in  x-space, 
the  system  being  (1)  with  u  corresponding  to  our  linear  feedback 
control.  We  now  illustrate  how  u  •  (u> ,  u, . u  )  is  computed. 

•  HJ 

Let  ej  -  s2  -  v,  *  <2 . c,  *  *  ‘J  *  1  "• 

From  reference  [12]  we  know  that  ult  u2 . um  and 

vlt  v2,  ...  ,  vm~  are  linearly  related  by  the  equations  (here  <•,•> 
denotes  the  duality  between  one  forms  and  vector  fields) 


<d1Vf>  +]C  ui  <dW  *  ¥i 

1-1 

» 

<dT,2’f>  *]£U1  <dT02'9i>  “  V2 


<dTn'fWEu1  <dV9i 


and  the  coefficient  matrix  for  u^  is  nonsingular  under  the 
conditions  of  Theorem  2.1.  Knowing  our  transformation  and  the 
feedback  controls  v>,  v2,  ...  ,  v^,  it  is  easy  to  compute 
“1.  u2 . V 

Using  Lyapunov's  theorem  and  substituting  ui,  u2.  ...  ,  u^ 
into  system  (1),  the  origin  is  asymptotical ly  stable,  forever, 
if  K  is  a  compact  subset  of  W  containing  the  origin  and  ;K 
(the  boundary  of  K)  is  a  level  set  of  V[y(x)),  then  the  origin  is 
asymptotically  stable  on  K.  This  follows  because  T  (and  T*:) 
map  level  sets  to  corresponding  level  sets,  trajectories  tc 
corresponding  tra jectorles ,  and  the  origin  to  the  correspondi ng 
origin.  We  state  this  as  a  theorem. 


-V.V-VJ  AJ.  'V.V.  • 
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7  h  tor  gw  3.3 

Suppose  we  have  a  transformation  T  mapping  system  (1)  to 
system  \Z)  with  properties  i)  through  v)  holding  on  an  open  set  W 
containing  the  origin  in  x*space  and  we  use  linear  feedback  in 
y-space  to  asymptotically  stabilize  the  linear  system  (eigenvalue: 
having  negative  real  parts).  Any  strict  Lyapunov  function  v(y) 
for  the  linear  systm  is  a  strict  Lyapunov  function  Y[y(x)]  for  the 
nonlinear  system  with  the  controls  corresponding  to  those  of  linear 
feedback.  Moreover,  this  nonlinear  system  has  the  origin  as  an 
asymptotically  stable  equilibrium  point  on  any  compact  set  K  whose 
boundary  is  a  level  set  of  V[y(x)]  and  with  K  C  w. 

Of  course,  by  taking  different  Lyapunov  functions  we  can  find 
other  sees  k  and  possibly  add  to  the  set  of  points  for  which 
solutions  starting  at  these  points  tend  to  the  origin. 

It  is  appropriate  to  Illustrate  this  method  of  constructing 
Lyapunov  functions  by  an  example. 

Example  3.4 

Me  take  the  nonlinear  system  on  R* 


• 

*! 

Sin  x 2 

V 

m 

♦  u 

*2 

1 

O 

_ 1 

1 

with  M  ■  {(xj.xj):  —  /Z  <  x2  <  »/Z) ,  an  open  set  containing  the 
origin  In  R1. 

It  Is  shown  In  reference  (14)  that  the  hypotheses  of  Theorem  2.1 
are  satisfied  end  that  one  transformation  on  W  is 

T,  ■  xi 
Tj  •  sin  x  2 
Tj  »  (cos  x;)u 

“he  canonical  linear  system  is 


o 

0 

m 

♦  V 

yz 

o  c 

Y: 

e 

A 

H4 


ana  v  ■  -2y,  -  2y2  places  the  eigenvalues  at  -1-1  and  i  - 1 . 
We  obtain 


h  0  1  y, 

*  C  / 

yz  -2  -2  y2  0 


we  solve  C  *P  PC  •  Q  to  find 
o  o 


5  11 
*  * 


Thus,  V(y)  *  y»Py  *  |  yt2  +  j  y2y2  ♦  g-  y22  Is  a  Lyapunov  function 


for  y  •  CQy  at  0. 

Ecuatlons  (4)  in  this  case  become 


'd"2,f>  +  u  <dT2,g)  *  v  or 

(cos  x2)u  »  -2y,  -  2y.  »  -2xi  -  2  $in  x2 

Solving  for  u  we  have 
2x, 

u  *  *  «T7T  - 

wnlch  takes  our  original  nonlinear  system  to 


XjJ  .[•  cosTJ  *  2  tan  *:j 
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The  function 


Vty(x)]  *  j  x,J  ♦  j  *i  sin  x2  ♦  j  (s1n2'x2) 

is  e  strict  Lyapunov  function  for  the  origin.  We  have  derived 
asymptotic  stability  for  any  compact  set  K  with  a  boundary  which 
is  a  level  set  of  V{y(x}<  and  which  is  contained  in  w. 

'■ie  check  the  eigenvalues  of  the  linearization  of  the  nonlinear 
system 


•  •  • 

*1 

sin  x2 

*2 

• 

* 

cos  X2 

at  the  origin.  The  Jacobian  matrix  of  the  right-hand  side 
evaluated  at  the  origin  is 

m  • 

0  1 

•2  -2 

Hence,  the  eigenvalues  are  exactly  those  of  the  linear  system  after 
linear  feedback  controls.  We  snow  later  that  these  eigenvalues  arc 
preserved  In  general,  which  is  an  important  step  in  our  robustness 
theory . 

4.  Robustness 

We  will  now  prove  that  the  method  of  design  using 
transformations  of  nonlinear  systems  to  linear  systems  is  robust. 
First,  we  need  i  topology  on  the  set  of  systems  of  the  form 
i  *  h(x),  that  Is,  <  topology  on  the  set  of  vector  fields. 

Let  V(K)  ie  the  set  of  vector  fields  on  a  compact  set 
K  C  3?n.  we  define  the  ;;l-norm  ilhlh  cf  a  vector  'ield  hcVUJ 
to  be  the -least  upper  bound  of 

1  h  ( x  i  ,  IQh(x)!! 
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for  xeK.  Here  |*i  is  the  usual  norm  on  Rn,  Oh  is  the  Jacobian 
matrix  of  h  (we  also  denote  this  by  eh/»x)  and  li  •  i!  is  the 
usual  operator  norm  on  the  set  of  nxn  matrices  L(JRn)  (see 
ref.  [15],  page  82);  that  is,  for  ScL(Rn),  II  S  li  *  max  { |  Sx  j :  |x  j  Si). 
Two  vector  fields  in  V(K.)  are  close  in  this  topology  if  and  only  if 
their  corresponding  coordinates  and  the  first  partial  derivatives  of 
these  coordinates  are  close  in  the  usual  topology  of  uniform 
convergence  on  K. 

For  an  open  set  II  C  u"  and  hel'(W),  the  C1  vector  fields 
on  U,  we  can  take  the  norm  II  hllj  defined  in  the  same  way  as  above 
with  the  understanding  that  II  h  llj  *  •  can  occur. 

Next  we  examine  the  linearization  of  the  nonlinear  system  (1) 
about  0 

m 

x(t)  «  Fx  ^  u1 < t )gi 0  _ 

i»l 

where  F  Is  the  Jacobfan  matrix  of  f  at  0,  and  where  g^c  is  the 
constant  part  of  g(,  1  •  1,  2,  ...  ,  a,  at  least  one  of  which  is 
nonvanishing  by  assumption.  We  rewrite  equations  (4)  In  the  vector 
notation 

u  •  H,*‘(x)v  ♦  Hj(x) 

Linear  feedback  v  ■  Hv  yields 
u  •  Hj*1 (x)Hy  +  Hj (x ) 

Since  f(0)  *  0,  the  linear  part  of  Hj(x)  is  of  the  form  Qx. 
Ignoring  Qx,  since  T(C)  *  0,  the  only  terms  of 

/  *  (y; .  ft.  •••  .  yn)  ■  <tj.  t2 .  Tn)  that  have  an  effect  on 

the  eigenvalues  at  0  in  system  (5)  are  those  contributed  by  the 
linear  terms  of  T,,  T,,  ...  ,  T  in  the  x,,  x-,  ...  ,  x 
variables. 

We  are  now  able  to  prove  the  following  lemma. 


leim  *.l 

If  1 Inear  feedback  is  aoplied  to  tna  linear  system  (2),  then  the 
resulting  eigenvalues  are  the  sam  as  those  of  the  linearization  (S) 

of  the  nonlinear  system  (1)  at  the  origin  with  oj,  u2 . 

substituted. 

Proof 

It  Is  apparent  from  the  discussion  preceding  the  statement  if 
this  theorem  that  we  have  reduced  Its  proof  to  that  of  a  linear 
system  where  the  transformations  Involve  linear  coordinate  changes 
and  feedback.  Ue  take  the  linear  system 

x  •  Fx  *  Gu 

and  let  z  ■  C*lx  and  u  ■  w  ♦  Qx  »  Dw  *  Ox.  Our  linear 

system  In  z  coordinates  with  control  vector  w  becomes 

i  ■  C"l(F  -  GQ)Cz  ♦  C*>G0w 

Setting  w  •  Hz,  which  Is  a  feedback  on  z-specc,  we- obtain 
i  -  {C-‘(F  ♦  S0)C  ♦  C-iGDHJz 
Plugging  u  »  Qx  *  OHCx"'  into  Fx  *  Gu,  we  have 
x  -  l(F  ♦  GO)  *  G0HC*Mx 

Since 

det.Ul  -  [C*‘(F  *  SQ)C  *  C-'GDH))  - 

det.UI  -  [(F  *  GQ)  *  GDHC* ’• )  1 

our  theoren  is  proved.  2 

The  statement  in  the  lemma  is  not  surprising  when  we  realize 
that  since  f (0)  •  0, 


(adkf,9j)(0)  *  Fk9l 


for  *11  k,  where  F  and  g^:  are  as  In  equation  (5).  Concerning 
eigenvalues  at  the  origin,  the  effect  of  the  transformation  T  on 
the  linearization  of  system  (1)  is  the  same  as  that  of  a 
transformation  that  takes  system  (5}  to  Brunovsky  canonical  form. 

We  take  the  following  results  from  Chapter  16  of  reference  [IS]. 
Theorem  <.2 

Let  h  be  a  g1  vector  field  on  W  C  Rn  and  xcW  be  an 
equilibrium  of  *  *  h(x),  such  that  Dh(x)  is  invertible.  Then  there 
exists  a  neighborhood  U  C  w  of  x  and  a  neighborhood  Tt  C  c  (w) 
of  h  such  that  for  any  hell  there  is  a  unique  equilibrium  Zell  of 
z  •  h(z).  For  any  c  >  0  we  can  choose  n  so  that  |z  -  x|  <  c. 
Theorem  <.3 

Suppose  that  x  is  an  equilibrium  point  for  x  *  h(x)  and 
Oh(x)  has  eigenvalues  with  negative  real  parts.  Then  in  Theorem  4.2 
It  and  U  can  be  chosen  so  that  if  hell  ,  the  unique  equilibrium 
tell  of  i  *  h(z)  satisfies  Dh(z)  has  eigenvalues  with  negative  real 
parts. 

The  following  well-known  result  is  found  in  Chapter  9  of 
reference  [151. 

Theorem  4.4 

Let  xcW  be  an  equilibrium  point  for  x  *  h(x)  with  Dh(x) 
having  negative  real  part  eigenvalues.  Then  there  is  a  neighborhood 
U'  C  H  of  x  such  that  all  solutions  of  x  »  h(x)  starting  in  U' 
converge  to  x;  thus,  x  Is  asymptotically  stable. 

This  theorem  Is  a  shortened  version  of  the  first  theorem  In 
Chapter  9  (see  ref.  [15].  If  every  eigenvalue  of  Dh(x)  has  its  real 
part  less  than  -c,  c  >  0  it  follows  from  the  proof  of  that  theorem 
that  the  set  U'  depends  on  c  and  continuously  on  h(x). 

tt  is  clear  that  if  the  hypotheses  of  Theorems  4.2  and  4.3  are 
satisfied,  we  can  choose  the  set  71  in  Theorem  4.3  so  that  there 
exists  a  constant  c  >  0  so  that  the  eigenvalues  of  Oh(z)  have  real 
parts  less  than  -c  for  all  he 71  .  Since  we  are  working  with  the  <!* 
tocology  on  the  set  of  systems,  the  way  in  which  we  choose  the  sets  U' 
in  Theorem  4.4  depends  continuously  on  elements  in  71. 

If  the  conditions  of  Theorem  4.3  are  net.  then  we  can  choose 
71  and  an  open  neighborhood  0  C  W  of  x  so  that  the  following 
properties  hold: 


(a)  for  any  hell  Chert  is  a  unique  equilibrium 
zcO  of  i  ■  h(z)  and  Oh(z)  hes  eigenvalues 
with  negative  reel  parts,  and 

(8)  for  tny  hcJl  ell  solutions  of  z  ■  h(x) 
starting  in  0  converge  to  z. 

In  our  application,  h  will  be  f  *  where 

u,,  1  si  sm.  are  controls  corresponding  to  t  linear  feedback  giving 
eigenvalues  with  negative  real  parts  in  the  linear  system  (2),  and 
*  »  0.  Of  course,  we  assume  that  the  nonlinear  system  (1)  is 
transformable  for  *cW  to  the  linear  system,  as  in  Theorem  2.1. 

From  section  3,  theorem  3.3,  we  have  a  striet  Lyapunov  function  V 
for  0  and  the  system 

M 

**'*5Zvi 

i*i 

and  we  take  K  to  be  a  compact  set  contained  in  U  whose  boundary 
Is  a  level  set  of  ¥(x).  Me  now  state  our  main  result. 

Theorem  <.5 

With  controls  u«,  u2 . ua  and  compact  set  K  as  just 

mentioned,  there  exists  an  open  neighborhood  H  of  f  ♦  u^ 
in  i'{K)  such  that 

(•')  for  any  hcJl  there  Is  a  unique  equilibrium 
point  icK  if  i*  h(z),  and 

(S')  for  any  h*Jl  all  solutions  of  i  •  h(x) 
starting  in  K  converge  to  z;  that  Is, 
z  Is  asymptotically  stable  on  K. 

Proof 

Lama  *.l  implies  that  the  systom  i  •  f  *  u^,  with  u< 

corresponding  to  our  asymptotically  stabilizing  linear  feedback 
in  the  linear  system,  has  eigenvalues  with  negative  real  parts  in  its 
linearization  at  3.  From  a  previous  discussion  we  know  that  there 
Is  a  neighborhood  11'  in  vOO  (  V(W)  in  orevious  results  can  be 
replaced  by  V! K) J  of  f  *  T1^  u^  and  an  open  neighborhood  0 
in  Jlfl  of  0  with  our  earlier  conditions  (a)  and  (s)  holding  for 
H '  on  0.  . 

Choose  a  compact  annular  region  a  C  K,  whose  outer  bounoary 
is  the  level  set  »K  (the  boundary  of  K),  and  whose  Inner  boundary 


is  another  level  set  of  V  contained  in  0  and  distinct  froa  0. 

We  recall  here  that  our  level  sets  of  V  do  shrink  to  the  origin. 
Because  V  is  a  strict  Lyapunov  function  of  *  ■  f  +  u^, 

we  have  V  *  <dV,f  +  Ujj^  >  <  0.  Since  we  are  working  in  the  e1 
topology  on  v(K),  we  can  choose  71  C  71  •  so  that  V  •  <dv,h>  <  0 
on  0  for  all  hell  and  z  is  the  unique  equilbrium  point  for  h 
in  K.  The  condition  that  <dV,h>  <  0  implies  that  tne  solution 
curves  of  i  ■  h(z)  intersect  the  level  sets  of  V  in  a 
transverselly  (the  tangents  to  the  solution  curves  are  not  tangent  to 
the  level  sets  at  points  of  intersection)  and  must  be  novlng  from 
outside  to  inside,  since  those  for  the  nearby  system 
x  «  f  ♦  u^  are  transversing  in  that  direction. 

Hence,  if  we  start  with  any  point  in  K  with  a  solution  of 
z  «  h(z),  we  can  reach  the  set  0-  Then  condition  (b)  guarantees 
that  we  converge  to  z,  and  statements  (a1)  and  (6‘)  are  proved. C 
In  practice,  suppose  the  plant  is  A  system  z  ■  h(z).  where 
hell  is  driven  by  the  controls  ui,  ut,  ...  ,  u_.  Then  Theorem  4.5 

lu 

proves  the  robustness  of  our  nonlinear  design  technique. 

He  conclude  this  section  with  a  remark  concerning  systems  which 

depend  on  parameter  values,  lot  system  (1)  depend  on  a  parameter  e, 

and  assume  for  e  •  a  we  have  a  transformable  system.  Suppose  we 
0 

design  with  the  parameter  value  »0  and  transformation  T(eQ). 

We  construct  a  Lyapunov  function  V(y)  as  before  and  compose  with 
T(8q).  The  sensitivity -of  the  asjmiptotic  stability  of 

R 

x(t)  «  f (x(t) ,9)  utlt)gf!x{t),8) 

1-1 


can  be  viewed  by  examining  the  e  derivative  of 
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where  is  the  k  component  of  g1(x(t),e]  and  similarly 


for  fk. 


5.  Time- Varying  Systems 


Consider  the  time-varying  nonlinear  system 


i(t)  «  f(x.t)  ♦  u4(t)a^«,t) 


with  f ,  jj,  gj  •  ...  .9,1  being  vector  fields  on  Bn  which  are  t “ 
In  all  arguments.,  Also  assume  that  f(G,t)  •  0  for  all  t.  Ue  want 

a  transformation  7  •  (Tj,  T2,  ...  ,  T _ )  that  maps  an  ooen  sat 

In  R  t(*i .  *j.  ...  .  Uj.  uj . uB,t)  space]  containing 

the  origin  In  Rn<"  onto  an  open  set  In 

R"*".r(yi.  *j.  ...  .  yn.  *•.  *2 . V'  *  (Tl*  T: . Tn-ws^ 

space]  containing  the  origin  and  takes  the  time-varying  system  (6) 

to  the  time-invariant  linear  system  (2K  Ue  also  want  conditions 
analogous  to  1}  through  v)  in  section  2  to  hold,  except  here  we  must 
ram  amber  that  T  Is  a  function  of  tlM  t.  Thus,  in  each  of  1] 
through  v)  we  add  the  phrase  "for  every  t." 

Our  Lie  derivative  Is  now  replaced  by  a  time-varying  Lie 
derivative.  Suppose  f  and  g  are  Cm  time-varying  vector  fields. 
Then 


(r°'.9)  ■  9 

(rlf.g)  •  (ad'f.g)  ♦  & 


(rkf.g)  •  ir‘f.(rk*1f.g)] 

In  this  cisa  we  define  new  sets  C(t)  and  Cj(t), 

j  »  1,  2,  ...  ,  in,  by  replacing  (adl<f,g)  In  our  definitions  for  C 
k 

and  Cj  by  (.’  f,g).  The  following  result  is  found  In  Hunt,  L.  A., 
Su,  and  Neyer,  S.  {Time  Varying  Systems,  article  in  progress). 
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Theorem  5.1 

There  exists  a  transformation  T  satisfying  our  new  conditions 

1)  through  v)  if  and  only  if  on  some  neighborhood  W  of  the  origin 
in  Fn 

1)  the  set  C{t)  spans  an  n  dimensional  space  for 
each  t, 

2)  each  sat  Cj(t)  Is  involutive  for  j  *  1.  2,  ....  m 
and  eacn  fixed  t,  and 

3)  the  span  of  Cj(t)  equals  the  span  of  C^(t)  n  C(t ) 

for  j.«  1,  2 . m  and  each  t. 

It  Is  shown  in  Hunt,  L.  R.  et  al .  (article  in  progress)  that 
the  transformation  T  is  constructed,  with  t  behaving  as  a 
parameter . 

We  apply  linear  feedback  to  the  time-invariant  linear  system  (2) 
to  asymptotically  stabilize  (with  real  part  negative  eigenvalues) 
this  system.  We  substitute  the  corresponding  controls  into 
equation  (6)  using  equations  from  Hunt,  l.  R.  et  al .  (article  in 
progress)  similar  to  those  In  equation  (4). 

Let  V(y)  be  a  Lyapunov  function  for  the  linear  system  (2)  as 
in  section  3  and  compose  this  with  the  transformation  T  to  obtain 
v (y(x.t)J .  Since  V(y(x,t)J  <  0,  a  solution  in  xj ,  x- ,  . . .  ,  xR,  t 
space  is  passing  through  the  level  sets  (the  inverse  images  of 
the  level  sets  of  V(y)  under  T)  transversal ly  and  from  the  outside 
to  the  inside.  Suppose  there  are  positive  definite  functions 
Vj(x)  and  V2<x) (V| (x)  has  continuous  first  partial  derivatives, 

V ^ ( 0)  *  0,  V - ( x )  2  0,  i  »  1,2)  on  some  compact  set  K  in 
xt,  x2,  ...  ,  xfl  space  with  the  origin  In  its  interior.  Suppose 
also  that  the  transformation  T  applies  to  K  for  all  t  and 
V,(x)  &  V(y(x,t)]  S  Vj(x)  for  xcK  and  all  t.  By  Theorem  4.2 
from  reference  (13)  we  know  the  origin  is  uniformly  asymptotically 
stable  for  the  system  (6)  with  the  controls  uj,  u2,  ...  , 
substituted.  In  fact,  any  solution  starting  at  a  point  in  the  region 
bounded  by  a  level  set  of  Vly(x.t))  with  xcK  for  all  t  must  tend 
to  the  origin  in  x-space. 

There  are  many  results  concerning  stability  of  time-varying 
systems  that  we  could  mention.  However,  we  wish  to  make  the 
following  remark  concerning  robustness  of  time-varying  systems. 
Practice  dictates  that  a  theory  is  to  be  applied  in  finite  time. 
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Let  us  assume  that  the  transformation  T  applies  to  an  annular 
region  A  about  the  t-axl i  whose  outer  boundary  and  Inner  boundary 
are  nontrivial  level  sets  of  V(y(x,t)).  Suppose  we  restrict  this 
region  to  a  finite  time  Interval,  say  tO,tfl] ,  tQ  >0.  If  a  system 
i  •  h(*,t)  Is  sufficiently  close  to  system  (6)  (with  ult  u2.  ...  ,  uB 
applied)  In  the  obvious  Cl  topology  for  vector  f1el<$  on  our 
truncated  annulus  A,  then  the  solutions  of  x  ■  h(x,t)  starting  in 
this  set  travel  transversaily  through  the  level  sets  (It  intersects) 
contained  In  a  moving  from  outside  to  Inside.  This  parallels  the 
related  results  for  t1me*1nvariant  systems  In  section  4. 

In  conclusion,  we  have  shown  in  this  paper  that  the  process  of 
stabilizing  a  transformable  nonlinear  system  by  stabilizing  the 
linear  system  to  which  It  maps  is  robust.  If  the  mathematical  model 
and  the  actual  nonlinear  plant  are  sufficiently  dose,  our  design 
scheme  using  transformations  Is  valid,  and  Lyapunov  functions  can 
often  be  found. 

The  design  technique  discussed  has” been  applied  to  several 
aircraft  of  Increasing  complexity.  The  completely  automatic  flight 
control  system  was  first  tested  on  a  OHC-6.  The  reference  trajectory 
used  in  the  flight  test  exercised  a  substantial  part  of  the 
operational  envelope  of  the  aircraft.  Despite  disturbances  and 
variations  In  plant  dynamics,  the  system  performed  well  (see 
ref.  (161 } .  Next  the  technique  was  applied  to  the  Augmentor  wing 
Jet  STOL  Research  aircraft  and  the  successful  flight  tests  are 
reported  in  reference  (3).  Methods  for  providing  pilot  Inputs  to 
tnls  design  were  examined  in  reference  [17).  Application  of  this 
design  scheme  to  the  control  of  an  A-7  for  carrier  landing  and 
testing  In  manned  simulation  Is  reported  In  references  [18]  and  [19]. 
The  design  methodology  Is  currently  being  applied  to  the  UH-IH 
helicopter,  again  with  the  substantial  portion  of  the  operational 
envelope  of  this  aircraft  being  used. 
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Global  Transformations  of  Nonlinear 

Systems 
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Ahuruct  —  Recent  mulls  have  establisned  necessan  and  sufficient  con¬ 
ditions  for  a  nonlinear  svslein  of  the  form 

i(0-/(«('))  +  -(')s(v(t)). 

with  AG)  m  0.  to  be  locally  equivalent  in  a  neighborhood  of  the  origin  in  R" 
to  a  cnnrrollabW  linear  system.  We  combine  these  results  with  several 
version*  of  the  global  inverse  function  theorem  to  prove  sufficient  condi¬ 
tions  for  the  transformation  of  a  nonlinear  system  to  a  linear  system.  In 
doing  so  we  introduce  a  technique  for  constructing  a  transformation  under 
the  assumptions  that  (g.  [/.»]. span  an  n  -dimensional 
space  and  that  {g.[f.  jJ.-  •  -.(ad*"*/,  S»  is  *n  invoiutive  set. 


1.  Introduction 

w  E  CONSIDER  nonlinear  systems  of  the  form 

*(r)m  f{x{t))+u(t)g(x{t))  (1) 

w  here  /  and  g  are  <2*  vector  fields  on  an  open  set  U  in  R" 
containing  the  origin  and  /( 0) «*  0.  The  problem  of  interest 
to  us  is  finding  sufficient  conditions  on  U,  /.  and  g  $0  that 
there  exists  a  c*  transformation  T  -  (T,.  Tz,-  •  from 

an  open  set  in  R"*1  to  an  open  set  in  R**1  with  the 
following  properties: 

1)  7*0) -0; 

2)  T,.  Tz,-  ■  -,T„  are  functions  of  x,.  x2.-  •  • ,x„  only  and 
T  maps  U  in  R"  into  T,.  72,  •  ■  •,  T„  space  with  a  nonsingular’ 
Jacobian  matrix: 

3)  is  a  function  of  x,,  x2,-  •  ■  .x„,  u  which  can  be 
inverted  as  a  function  of  u  and  where  (x,.  x2.-  •  -,x„) e  U : 

4)  r„  72.-  •  •.  r,*,  satisfy 

Wi 

n-r, 

ti-T,.,;  (2) 
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5)  T  is  one-to-one  (with  (T,.  72,-  ■  ■  ,T„)  being  one-to-one 
on  U). 

That  is,  when  can  we  map  a  nonlinear  system  in 
x,.  x2>-  •  -,xn,  u  space  with  (x,.  x;.-  •  -,xn)  S  L  to  a  linear 
system  in  Tt,  72,  •  •  • ,  Tn~  ,  space  talcing  the  system  ( i )  to  the 
system  (2)?  In  (2)  we  think  of  Tnri  as  the  control,  and' our 
linear  system  is  in  integrator  form  on  /t-dimensional  space. 

Krener  [1]  solved  the  problem  of  which  nonlinear  sys¬ 
tems  can  be  transformed  to  linear  systems,  using  a  change 
of  variable  without  feedback.  Brocketi  [2]  gave  sufficient 
conditions  for  a  real-analytic  nonlinear  system  with  equi¬ 
librium  point  at  the  origin  to  be  locally  equivalent  (using 
coordinate  changes  and  feedback)  to  a  linear  system  in 
integrator  form.  The  third  author  and  Cicolani  [3J.  [4] 
presented  sufficient  conditions  for  a  nonlinear  system  (per¬ 
haps  time  varying)  in  block-trianzular  form  to  be  trans¬ 
formed  to  a  linear  system.  These  results  are  presently  being 
applied  in  the  design  of  automatic  flight  control  systems 
for  aircraft. 

The  references  just  mentioned  led  the  second  author  [5] 
to  prove  necessary  and  sufficient  conditions  for  a  local 
diffeomorphism  (mapping  origin  to  origin)  to  exist  which 
carries  system  (l)  to  system  (2).  The  transformations  in¬ 
volved  contain  both  feedback  and  coordinate  changes  and 
are  more  general  than  those  found  in  [2].  Proofs  depend  on 
the  solution  of  an  overdetermined  system  of  linear  partial 
differential  equations. 

We  combine  these  results  with  global  inverse  function 
theorems  and  techniques  in  partial  differential  equations  to 
prove  global  theorems  for  transforming  a  nonlinear  system 
to  a  linear  system.  Local  results  tell  us  that  there  is  a 
neighborhood  of  the  origin  which  is  mapped  under  a 
transformation,  but  give  us  no  information  about  its  size. 

A  brief  history  of  global  inverse  function  theorems  is 
appropriate.  Hadamard  [6]-[8]  proved  the  following  result. 
A  c*  map  F:  R"  -» R"  is  a  diffeomorphism  onto  R"  if  and 
only  if  its  Jacobian  matrix  is  nonsingular  for  each  x  s  R" 
and  F  is  proper  (i.e.,  inverse  images  of  compact  sets  are 
compact).  Palais  [9]  gave  a  proof  of  this  theorem,  and 
certain  variations  of  inverse  function  results  are  presented 
in  [101. 

The  question  of  the  existence  of  global  inverses  is  trtaied 
in  the  area  of  systems  theory  by  Wu  and  Descer  [11]  and 
by  Kuh  and  Hajj  [12].  With  regard  to  the  problem  of 
global  observability,  we  find  the  papers  of  Fitts  (13). 
Griffith  and  Kumar  [14).  and  Fujisawa  and  Kuh  [IS], 
Applying  the  results  of  Fujisawa  and  Kuh  and  of  3erger 
and  Berger  [10].  Kou  ei  al.  [16]  presented  theorems  on  the 


T*A-ss'0*.M.*r:C'Vj  of  '■online.**  systems 

topic  of  global  observability.  using  inverse  functions.  For 
recent  results  or.  global  inverse  functions,  we  refer  to  the 
papers  of  Sandburg  [17]  and  Miller  [  1  SI. 

Given  vector  fields /  and  g.  we  denote  the  Lie  bracket  of 
/  and  g  by  {/.  j].  and  define  inductively 

{sd'-f.g)- [/.[/.si]."-- 
(ad‘/.g)  -  [f.(ad‘~'f,g)}. 

Given  system  (1),  we  suppose  that  the  n  x  n  matrix  with 
columns  g,[f,  g].- •  -.(ad""1/.  g)  (which  in  this  paper  we 
call  the  controllability  matrix,  even  though  for  nonlinear 
systems  controllability  can.  certainly  depend  on  other  Lie 
brackets  than  these  (19>-[21j)  is  nonsingular  at  every  point 
in  3".  and  g.[j\  g],-  •  g;  axe  involutive  on  3". 

We  introduce  a  certain  matrix  with  the  property  that  if  it 
satisfies  the  ratio  condition  found  in  [16].  we  show  the 
existence  of  a  transformation  7*»  (7",,  T:,- •  * ,  7^_ , )  from 
J?"*1  tc  R""1  with  (7*, .  r,.-  •  -,TJ  napping  3"  to  R"  (and 
having  a  nonsingular  Jacobian  everywhere  on  R")  such 
that  the  system  (1)  is  mapped  to  the  system  (2).  Under 
these  assumptions  the  map  is  one-to-one,  and  we  have 
glooal  invertibility. 

I:  U  is  an  open  subset  of  R"  containing  the  origin,  we 
give  sufficient  conditions  for  there  to  exist  a  transforma¬ 
tion  r-(r,.r,.--,r..!)  with  mapping  u 

one-to-one  onto  its  image. 

In  all  cases,  the  proof  of  the  existence  of  our  transforma¬ 
tions  is  constructive  in  nature,  which  should  be  usefui  in 
future  applications. 

II.  PRELIMINARIES 

In  this  section  we  discuss  some  basic  definitions,  a 
classical  result  from  differential  geometry,  and  the  basic 
ideas  of  the  theory  in  [5],  We  begin  by  introducing  three 
kinds  of  Lie  derivatives  and  a  Leibnitz-type  relation  be¬ 
tween  them.  An  enlightening  discussion  of  these  Lie  deriva¬ 
tives  and  their  applications  in  system  theory  is  presented  in 
the  paper  of  Hermann  and  Kxeaer  [22].  For  basic  refer¬ 
ences  we  suggest  [23]— (25},  and  [26} 

If  /  and  g  are  3*  vector  fields  on  R"  (actually  on  any 
differentiable  manifold  A/),  we  define  the  Lie  bracket  of  f 
and  g 


if.  1 1 


-Ur-U, 


where  dg/dx  and  df/dx  denote  n  x  n  Jacobian  matrices. 
This  L-.e  bracket  [f.g]  is  also  a  vector  field  on  3"  and 
represents  the  Lie  derivative  of  one  vector  field  with  re¬ 
spect  to  another.  Of  course,  we  can  also  discuss  successive 
Lie  brackets  [/.[/.  g]j.  [g.[f.  g]].  etc.  We  define 

[adxf.g)m[f.g] 

[/•(/•*]] 


1  adkf.  g )  -  [f .{odl~'? .  g\\. 


A  set  of  ix  vector  fields  ,/,.  /'..  •  ■./.,  or.  R~  ;o.lec 
:nvoiu:.ie  if  there  exist  cx  functions  y, „i.v.  such  t.ra. 

r 

[/./](*)  *  £  y.jt(x)fJx).  1 

A  ■  I 

This  definition  leads  to  a  version  of  the  famous  Frober.ius 

theorem. 

Theorem  2.1:  Let /,./;.•••,/,  be  an  involutive  collection 
of  3®  linearly  independent  vector  fields  on  R*.  Given  any 
point  x0  SR",  there  exists  a  unique  maximal  /--dimensional 
3*  submanifold  S  of  R*  containing  x0  such  that  the 
tangent  space  to  S  at  each  x  £  5  is  the  space  spanned  by 
/,(jc). /-(*).•  •  ,/,(x);  that  is,  S  is  the  unique  integral 
manifold  of /,./-.••• ,/.  through  x0. 

We  now  define  the  Lie  derivative  of  a  function  with 
respect  to  a  vector  field,  which  takes  functions  to  functions. 
Let  /  be  a  3*  vector  field  and  h  a  3*  function,  with 
gradient  dh  on  R".  Then  the  Lie  derivative  of  h  *:th  respect 
to  f  is 

L,(h)-(dk.f ) 

where  (-,-)  denotes  the  duality  between  one-forms  and 
vector  fields  (the  cotangent  bundle  and  tangent  bundle  if 
one  is  working  on  a  manifold  A/).  This  duality  is  easily 
understood-if  we  set 


If /is  a  3“  vector  field  on  R"  and  u  is  a  3*  one-ferm  on 
R\  i.e.,  u  » tj,dx,  ■+•  vzdx.  +  •••  -r  u„dxm,  u,  being  3* 
functions,  we  have  the  Lie  derivative  of  u  with  respect  to  f 


where  •  denotes  transpose  and  du'/dx  and  df/dx  are 
Jacobian  matrices.  Such  a  Lie  derivative  maps  one-forms  to 
one-forms. 

The  above  three  kinds  of  Lie  differentiation  are  related 
by  a  Leibnitz-type  formula 

£/(«.*>  •<£/(“)•  s>  -<«•[/•$]} 

with  /  and  w  as  before  and  g  a  3"  vector  field.  Also. 
dLj(h)-  L{(dh)  with  h  a  3*  function. 

We  examine  the  problem  of  finding  a  map  with  nonsin¬ 
gular  Jacobian  matrix  of  the  system  on  S* 

x(r)-f(x(i))+u(t)g(x(r))  (I) 

with  f(0>  -  0  to  the  linear  system 


From  [5]  necessary  and  sufficient  conditions  :':r 
existence  of  such  a  map  are  that 

.13  .  .  ...... 


i)  the  controllability  matrix  (?.[/.  g].  •  .( >id"~ '/.  e ): 
has  rank  n  in  some  neighborhood  of  the  origin  in  3"  (with 
variables  ac , ,  v:.-  •  -.x,,).  and 
u)  the  set  o:  vector  fields  (g.[/.  g].- •  ■.{ad”''-/,  g)}  is 
involutive  in  some  neighborhood  o''  the  origin  in  R". 

Note  that  by  i)  g.[f.g),---.{ad"~-f.  g)  are  linearly  in¬ 
dependent  and  by  li)  there  is  an  n  -  1 -dimensional  integral 
manifold  of  this  involutive  set  by  the  Frobenius  theorem. 

The  question  arises  ns  to  how  the  existence  of  a  local  (in 
the  neighborhood  of  the  origin)  invertible  transformation  T 
is  shown  in  [5]  under  Assumptions  i)  and  ii).  Such  a  map 
Tm  (7,.  r2,- •  must  satisfy  the  following  partial 

differential  equations: 


+ 


iZL,+iZL,  • 

dx/'  +  dx/2  ' 


i  -  1,2,-  •  •  ,n  —  1, 


i  +  “?i)+Tr(/i  +  u3z)  + 


which  we  write  as 


0.  i»l,  2, 

< dT„f)-Lf{T,)-T, 

<dr, ,  /  -r  «g>  -  L/+„t  (rj  -  r„. , . 


1, 


Note  that  7",,  are  functions  of  x,,  x2,-  •  -  ,xR  only 

and  that  r„_,  is  a  function  of  x,.  x2,-  •  -,x,  and  u. 

Now  by  our  Leibnitz  formula 

-  £.,<^7*,.  *> *]> 
<drJ,g>-<L/(drI).g> 

-o+<dr, .[/,[/, g]]> 

-  (dTi,(ad1f ,  g)) 
and 

?)  -  (Lf(dT„.t).g) -(-\)"'(dT).(adn-'f.S)). 

Thus  if  we  know  f,.  then  r;.  T},-  ■  ■ .  r„_ ,  can  be  found  by 
Lie  differentiation.  This  fact  is  also  pointed  out  in  [27],  but 
no  conditions  for  the  existence  of  Tt  are  given  there.  Thus 
(J)  becomes 


( dTt.(adlf.g )>  -0. 


'dT^.f*  ug)  -  r„.,. 


It  is  shown  in  [5)  that  T..  TV  ■  •  ■ .  r„_ ,  have  linearly 
independent  gradients  if  and  only  if  the  controllability 
matnx  g].-  •  ad"'/,  g);  has  rank  n  if  and  only  if 

we  have 


(dT,.{adkf.g))  =  Q. 

<dr1.(cd-,/.g)>-o. 


k  -  o.i.- 


If  the  last  equation  in  (6)  is  satisfied,  then  we  can  solve 
for  u  in  the  last  equation  in  (5)  since  (dT„,  g)  - 
(- \)’'~x{dTx,{adn~[f,  g))  as  before.  This,  together  with 
the  facrthat  7",.  7";,- •  -,T„  are  functions  of  x,.  x;,- •  •  .x„, 
only  turns  our  problem  of  finding  a  global  transformation 
T  into  one  of  finding  a  global  solution  of  (6).  With  this  m 
mind,  we  turn  to  the  question  of  solvability  of  paruai 
differential  equations. 

III.  Partial  Differential  Equations 

We  shall  solve  the  overdetermined  system  of  linear  par¬ 
tial  differential  equations  (6).  For  an  introductory  discus¬ 
sion  of  this  topic,  we  refer  the  reader  to  the  book  of  John 
[28]. 

We  introduce  the  parameter  s  and  begin  our  construc¬ 
tion  of  a  3*  solution  7",  of  the  system 

(dT„(adkf.g))m  0.  * -0.1. •••,/! -2 

<<fr„(«r-'/,g)>-o.  (6) 

First  we  solve  for  all  s  e  R  the  system 

with  initial  conditions  x(Q)  -  0  to  find  the  integral  curve 
of  the  vector  Field  (ad"-1/,  g)  through  the  origin  in  n 
space.  We  remark  thai  we  could  start  with  any  vector  field 
that  is  linearly  independent  of  g.[/,  g],-  •  g). 

Next  we  solve  for  all  r,  S  R  the  system 

with  initial  conditions  x(r,0)  -  x(r).  To  solve  the  partial 
differential  equation  (dTv(ad”~:f.  g))  -  0  we  must  find  a 
function  T,  so  that  dT^/dt ,  -  0;  thus,  we  have  reduced  to 
a  system  of  ordinary  differential  equations.  We  then  ex¬ 
amine  for  all  f;  e  R  the  system 

with  initial  conditions  x{s.  r,. 0)  -  x(s,  r, ).  To  scive 
■  dT^(ad"~'‘f .  g))  »  0,  we  show  the  existence  of  a  7,  w-.th 
dT./du-  0.” 

We  repeat  this  argument,  with  the  last  step  being  to 
consider  for  all  r„_,  SR  the  system 


with  (ad°f.g)  -g 

% 


having  initial  conditions  x( s,  !,.■■■  ,r„.:.0)  •  x{s.  r,.- 
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For  {dT,.  g)  -  0  wff  need  a  function  7,  with 
*T,/5r„_, -0. 

For  our  solution  we  choose  T,  ~  s  (actually  any  £® 
function  of  s  which  vanishes  at  (0.0,-  •  -.0)  "and  with 
nonvanishing  derivative  will  work).  Since  the  set 

ii’iJ’  jL' '  *.(gs "  */•  ?)}  is  involutive.  for  each  fixed  r  the 
map 

^  m  ("t^*  f|»*  •  * *—  i ) ■  *:(*.  t,.-  •  •,rll_i),-  • 

-  x»(s<  lt<‘  ’ ' 

defines  an  integral  manifold  of  this  involutive  set.  Hence 
r,  *  r  does  solve  the  desired  equations  as  a  function  of 
Ji  r  i » *  •  • ,  Jj,  _  i  since  it  is  constant  on  every  such  integral 
manifold  and 

iH  iH  9Ti 

all  vanish. 

Tne  important  question  to  answer  is  when  can  the  map  F 
be  inverted  in  the  sense  that  we  solve  for  r,  ,  as 

functions  of  Before  addressing  this  problem, 

we  shew  that  the  last  equation  in  (6) 

{dTx,{od'-'f,g))-  0 

is  solved. 

Computing,  we  have 


neighborhood  of  the  origin  on  which  the  man  f  ;$ 
ible. 

Thus  conditions  for  showing  the  existence  of  a  ziofca 
transformation  T  -  ( T, .  7, .  •  •  ■ ,  7„ . , )  can  be  interpreted  in 
terms  of  the  matrix  (7). 

Before  moving  on  to  global  considerations,  we  examine 
the  Jacobian  matrix  of  (7„7;.  with  respect  to 

s,  t„-  ■  Equations  (6)  imply  that 

dT,  dT ,  ST, 

dt,  *  5rJ>"15‘._1 

are  all  zero.  Applying  the  Leibnitz  formula  once  to  (6)  we 
find 

(dT2,  (ad  */,  g))  *»  C.  <  »0, 1, ■••./» -3 

(dTl.(ad--f,g))mO. 

Thus,  we  have 

dTx  dT  dT 

at,'  dt-/  '  ot„_, 

must  vanish.  Similarly, 

(dTj,{adkf,g))m  0,  *-0,1, ■••,11-4 

<rfr„(«f-*/.l)>-  0 

imply 


>  t — dx,  +  -~drj  +  •••+  — 

5x,  5x2  J  5x 

57^  5r  57,  5r  , 

'  dx  dx^'  +  TTJr*'* 


dT,  ds 
+■  m —  ~r—dxm 
ds  dx.  ’ 


VI  I  , 

If  (.dT,,{ad"~'f,g))»Q  since  dT,  and  (adm~Kf,  g)  are 
both  nonzero,  we  have  that  ( adm~xf,g )  must  be  tangent  to 
an  integral  manifold  of  {*,(/.  g),- •  -,(adm~lf,  g)},  con¬ 
tradicting  the  fact  that  the  controllability  matrix  has  rank 
n. 

Hence,  our  problem  is  solved  once  we  know  that  we  can 
find  j, /„•••,/,_,  as  functions  of  x„x2, •••,*„.  This  de¬ 
pends  on  the  Jacobian  matrix  of  F 


dx, 

2*i 

dx, 

ds 

3i i 

Eh 

ih 

dx. 

as 

dt\ 

1 

e 

being  nonsingular  (this  is  called  the  noncharacteristic  con¬ 
dition  in  panic!  differential  equations).  By  design,  this 
matrix  is  the  controllability  matrix  along  the  integral  curve 
°f  .Si  through  the  on  sun.  Hence,  there  is  an.  open 

*  '  V  _  I*.  .  -  •  V  *f*.  VU  -  A  fc.*  •  %.*  U  ..*1  A*.  U 


573  „  57,  57, 

Continuing  in  this  manner,  we  find  that  the  Jacobian 
matrix  of  7,,  7,,-  •  -,7„  with  respect  to  s,  r„- •  has  all 
entries  above  the  diagonal  zero. 


IV.  Global  Transformation  Results 

We  give  sufficient  conditions  for  a  global  transformation 
(of  the  type  of  interest  to  us)  to  exist  from  a  nonlinear 
system  to  a  linear  system. 

Our  theory  depends  on  the  Jacobian  matrix  (7)  of  the 
map  F  that  we  constructed  in  the  last  section.  We  refer  to 
this  matrix  as  the  noncharacteristic  matrix  because  of  its 
application  in  partial  differential  equations.  Note  that  this 
matrix  does  not  depend  on  the  7,  map,  but  only  cn 
the  functions  x,(s,  r|t- - x.(r,  r„  ■, 

x.(j.r„- 

The  first  result  we  need  can  be  found  in  (16). 

Theorem  4.1:  Suppose  that  there  is  a  map  k’\  R*  —  3" 
which  is  differentiable  with  Jacobian  matrix  J{  x ).  If  there 
exists  a  constant  c  >  0  such  that  the  absolute  values  of  the 
leading  principal  minors  A,,  A..-  •  -.A,  of  J( x )  satisfy 


|A-j 


li 


for  all  x  s  KVthen  H  is  one-to-one  from  3"  onto  3". 

The  condition  stated  on  the  absolute  .alues  of  leading 
orincinai  minor*  r.iii-A  «h#  — • .,  .-<<•» .... 


Theorem  4.2:  Assume  that  the  controllability  matrix 
of  system  (1)  is  nonsingular  on  R\  the  set 
{£.[/.  ••.(ad"-2/.  g )}  is  involutive  on  R".  and  the 

noncharacteristic  matrix  satisfies  the  ratio  condition  on  R  \ 
Then  there  exists  a  0®  transformation  7-  (7*,.  7..-  ■  •. 
7„_,)  with  the  following  properties. 

1)  7(0)  -  0; 

2)  7,.  7,.-  •  -,Tn  are  functions  of  x2,-  •  -,x„  only  and 
the  rtx  n  Jacobian  matrix  is  nonsingular  at  each  point  of 

R"; 

3)  7^,,  is  a  function  of  (x,,  x2,- •  -,xK,  u)  which  can  be 
inverted  as  a  function  of  u  and  where  (jc,,  x2,-  •  -,x„)  e  R*; 

4)  T  maps  the  system  (1)  to  the  system  (2); 

5)  the  map  ( T ..  72,-  •  -,T„)  is  one-to-one  on  R"  and  T  is 
one-to-one  on  R”+1. 

Proof:  By  Theorem  4.1,  the  map  F  whose  Jacobian 
^matrix  is  the  noncharacteristic  matrix  is  one-to-one  from 
R"  onto  R".  Thus,  we  can  globally  solve  for  s,  r|t- • 
as  functions  of  je,,  x2>- •  -,x„,  and  by  the  construction  of 
7-(7,.7,,---,7„*()  in  the  preceding  section,  the  trans¬ 
formation  (7,, r2>- •  -,Tn)  is  defined  on  all  of  R".  More¬ 
over,  since  the  controllability  matrix  has  rank  n,  the  map 
(7,,7:,- •  -,T„)  is  nonsingular  on  R"  as  mentioned  previ¬ 
ously.  Properties  1),  3),  and  4)  follow  by  the  design  of  T. 

We  know  that  the  transformation  (T,,  72,-  ■  -  ,T„)  has  a 
nonsingular  Jacobian  matrix  with  respect  to  r,  r,,-  •  -,r„_ , 
and  that  all  entries  above  the  diagonal  are  zero  by  com¬ 
ments  in  Section  III.  By  construction  we  also  have  7,  -  s,  T2 
is  a  function  of  s  and  r,  only  with  dT2/dtx  «  0  in  R  V  • 
7„_  |  is  a  function  of  j,  /,,*  only  with  dTl)_l/dtll_2 

•  0  on  R",  and  T„  satisfies  dT„/dt„_l»Q  on  all  of  R*. 
This  impUes  that  (T,,  T2,-  •  -,7„)  is  one-to-one  on  R".  The 
fact  that  7- (T,, 7,,- •  -,T„+X)  is  one-to-one  on  R"*1  fol¬ 
lows  from  (-1  r-,<dr„(ad'->/,g))«  +  <dr„,/>-TB<1. 
[see  (5)  and  (6)]. 

Example  4.1:  Consider  the  nonlinear  system  on  R2: 
fx.l  Uxf  +  e'i  +  xj-l]  [01 


-/(*(0)+«(0f(*(0). 

Computing,  we  find 


which  is  linearly  independent  of  g  on  R2.  We  first  solve 
dxy/ds- -(**:  +  !)  with  x,(0) - 0  and  dx2/ds-0  with 
x2(0)m0  to  obtain  x,m  —2s  and  x2  —  0.  Next,  we  solve 
dxK/itm  0  with  x,(r.0)--2j  and  dx2/dt-\  with 
x-(r.O)  -  0  to  obtain  x,  - -2r  and  x~  -  t. 

'The  function  f,  -  s  certainly  satisfies  (6)  as  a  function  of 
s  and  r.  The  noncharacteristic  matrix  is 

-2  O' 

.0  1  , 

which  fulfills  the  ratio  condition  with  <  -  1/2.  Hence  our 
transformation  is  defined  on  all  of  R-.  the  Jacobian  matrix 


of  (T|,  Tz)  is  nonsingular  everywhere.  (7.  T:)  is  one-to-one 
on  R:,  and  T  -  (T,,  T;.  73)  is  one-t.  -one  on 
Since  7,  —  s,  we  have 


os  dx . 


-  -  4(2r2  +  e' -  1  +  /). 

The  Jacobian  matrix  of  (7,,  7,)  with  respect  to  (s.i)  is 

[  1  0  ] 

which  satisfies  the  ratio  condition  on  R2  with  t  - 1/2. 
Thus,  the  map  is  also  onto  R 2. 

In  Theorem  4.1,  the  same  conclusion  holds  if  the  ratio 
condition  on  J(x)  is  replaced  by  the  ratio  condition  on 
AJ(x)  for  some  nonsingular  constant  n  x  n  matrix  A  (see 
[16]). 

Corollary  4.1:  Suppose  that  there  exists  an/tXn  nonsin¬ 
gular  constant  matrix  A  such  that  A  multiplied  on  the  right 
by  the  noncharacteristic  matrix  satisfies  the  ratio  condition 
on  R".  If  the  controllability  matrix  of  system  (1)  is  nonsin¬ 
gular  on  R"  and  if  the  set  {g,[f,g],---.(adH~zf.g)}  is 
involutive  on  R",  then  the  conclusions  of  Theorem  4.2 
hold. 

Given  a  nonlinear  system  (1),  it  is  often  possible  to 
determine  that  there  is  not  a  global  transformation  7  with 
(7|,  T2,  -  •  •.  7„)  mapping  R" in  a  one-to-one  manner  without 
going  through  the  process  of  showing  that  one  cannot  be 
constructed.  Remembering  that  7„_ ,  is  the  control  in  (2), 
we  write  that  linear  svstem  as 


+  7",+i 


<AT+vB 


where  the  definitions  of  7,  o,  and  B  are  obvious. 

Lemma  4.1:  Under  a  transformation  7  the  set  of  points 
where  /  and  g  are  linearly  dependent  must  map  by 
(7,,  72>-  •  7„)  to  the  set  of  points  where  AT  and  B  are 
linearly  dependent. 

Proof:  The  set  of  points  where  A  7  and  B  are  linearly 
dependent  is  defined  by  T2  -  0.  7,-0.  7,-0.  If  x  is  a 
point  in  R"  such  that  /(x)-cg(x)  (or  some  constant  c, 
then  from  (4)  we  have 

(dT,.  g)  »  0,  f-1.2,- 


which  implies  that  (dT,,  cg)(x)  -  0  or  (dT,.f)(x)  -  0. 
Then  (dT,,  f)  i  -  1,2,- •  •,«  -  1,  giving  us  7,-0. 

7j  -  0,  •  •  • .  7„  —  0  at  x.  Q’.E.D. 

Tne  proof  of  this  lemma  actually  shows  the  invariance  of 
the  linearly  dependent  sets  under  the  kind  of  transforma¬ 
tions  we  consider.  In  [19j.  [20],  and  (2!)  the  importance  of 
these  sets  in  controllability  is  demonstrated.  Given  a  point 
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in  the  1: nearly  dependent  set  of  x  -  f  -  ug.  feedback  can 
be  used  to  take  this  point  to  an  equilibrium  point. 

Lemma  4.1  can  be  applied  to  show  the  impossibility  of 
having  a  global  one-to-one  transformation  in  certain  cases. 
Example  4.2:  'Ve  take  the  nonlinear  system 


■/(*(0)+«*(0*(*(0) 

on  R:.  Using  the  techniques  of  [23],  it  can  be  shown  that 
this  is  a  controllable  nonlinear  system  on  R1.  Is  there  a 
global  transformation  T - (7,.  7:,  73)  with  a  nonsingular 
.  Jacobian  matrix  and  with  (7,.  3*,)  mapping  R:  in  a  one-to- 
one  fashion  so  that 


f,-r2 

W,? 

The  set  of  points  where  /  and  g  are  linearly  dependent 
consists  of  the  two  straight  lines,  x2  -  0  and  x2  - 1.  In  the 
linear  system  the  vector  fields 


and 


are  linearly  dependent  if  and  only  if  72  -  0.  There  is 
certainly  no  one-to-one  map  talcing  two  straight  lines  to 
one.  Hence,  this  system  is  not  globally  transformable  in  a 
orte-to-one  sense. 

It  is  possible  to  construct  a  transformation  in  a  neigh¬ 
borhood  of  the  origin.  The  solution  to 


dx 

ds 


-l/.fl 


with  x,(0)  -  0  and  x 2(0)  -  0  is  x,  -  -  r  and  x2  -  0.  Solving 


with  x,(r.O)  -  -  r  and  x.(r.O)  -  0,  we  have  x,  -  -  s  and 
x2  « t.  The  noncharacteristic  matrix  is 


and  this  satisfies  the  ratio  condition  on  R1.  Thus,  our  map 
(x,(r,  i).  x:(r.  r))  is  a  one-to-one  transformation  of  R: 
ontoR:. 

The  controllability  matrix 


has  rank  2  on  {(x,,  x:):  x:  <  1/2}  From  [5],  we  know  that 
a  transformation  exists  in  a  neighborhood  of  the  origin  in 
( x( ,  x; )  space.  Setting  7,  -  s.  x,  «  -  r.  and  x-  -  i.  *e  have 
the  transformation  T,  ■  -  x,.  7.  -  -x-  +  x:.  and  -  (1 
—  2x;)u.  which  exists  on  the  set  {(x,. x.) ■  x;  <  1/2). 
Moreover,  the  Jacobian  matrix  of  ( 7,.  7- )  is  nonsingular  on 
this  set  and  (7,.  7;)  is  one-to-one  there. 


This  example  suggests  the  following  coro'.Lr.  to  Theo¬ 
rem  4.2  (and  Corollary  4.1).  the  proof  of  which  fellow  j  that 
of  the  theorem.  Let  U  be  an  open  set  in  ?. "  containing  the 
origin  and  /  and  g  bt  c*  on  R". 

Corollary  4.2:  Assume  that  the  controllability  matrix 
of  system  (1)  is  nonsingular  on  U.  the  set 
{ g.[f ,  gj,-  ■  ■  ,(ad"~'lf.  g)}  is  involutive  on  and  the  non¬ 
characteristic  matrix  (or  the  nor.characteristic  matrix  with 
a  premultiplication  by  an  n  x  n  nonsingular  constant  ma¬ 
trix  A)  satisfies  the  ratio  condition  on  R".  Then  there  exists 
a  <2®  transformation  7«  (7;,  72,-  •  *,  TH„ ,)  with  the  prop¬ 
erties  l)-5)  of  Theorem  4.2  holding  where  R*  is  replaced 
by  U. 


V.  Other  Transformation  Results 

Suppose  that  / and  g  in  (1)  are  6*  on  R\  The  proof  of 
Theorem  4.1  in  [16]  applies  to  any  n-dimensional  open 
(possibly  unbounded)  rectangle  R  in  (x,.  x:.-  •  -  .xj  space 
with  sides  parallel  to  one  of  x,  -  0.  x-  »  0.-  •  -  .x„  •  0  and 
containing  the  origin  [in  this  case  we  have  H  is  cne-to-one 
onto  the  image  H(R.)\.  If  U  is  an  open  subset  of  R" 
containing  the  origin  and  U  C  F~  \R)  (the  inverse  image 
of  R  under  the  map  F  whose  Jacobian  i*  the  noncharacter¬ 
istic  matrix)  and  if  the  noncharacteristic  matrix,  with  a 
possible  nonsingular  premuhiplication,  satisfies  the  condi¬ 
tion  on  R,  then  a  result  similar  to  Corollary  4.2  can  be 
proved. 

We  are  interested  in  assumptions  other  than  the  ratio 
condition  which  can  be  applied  to  the  noncharacteristic 
matrix.  The  following  result  is  found  in  [16]. 

Theorem  5.1:  Suppose  that  there  is  a  map  H:  R"  -*R" 
which  is  defined  on  an  open  convex  subset  Q  of  R*. 

a)  If  H  is  differentiable  with  Jacobian  matrix  J(x),  and 
if  there  exists  a  constant  nXn  nonsingular  matrix  A  so 
that  AJ(x)  is  positive-definite  for  all  x€fl.  then  H  is 
one-to-one  from  Q  onto  J/(Q). 

b)  If  H  is  continuously  differentiable  with  Jacobian 
matrix  J(x),  if  2  is  bounded,  and  if  there  exists  a  constant 
n  x  it  nonsingular  matrix  A  so  that  det  AJ{x)  >  0  for  all 
xsQ  and  AJ(x)+{AJ(x))9  has  nonnegative  principal 
minors  for  all  x  €  Q,  then  H  is  one-to-one  from  C  onto 

my 

In  our  application  of  this  theorem  we  assume  that  Q 
contains  an  open  neighborhood  of  the  origin  in  (s.  r..-  •  •. 
f„_i)  space.  The  proof  of  our  next  result  is  obvious  from 
previous  considerations. 

Theorem  5.2:  Let  Lbe  an  open  set  in  R"  containing  the 
origin  and  with  U  c  f~\ Q).  Suppose  that  the  controllabil¬ 
ity  matrix  of  system  (1)  is  nonsingular  on  L.  the  set 
{$•[/.  gj.  •  •  ■  .(ad"~lf,  g))  is  involutive  on  i.  and  the  non- 
characieristic  matrix  satisfies  a)  or  b)  in  Theorem  i.l  Then 
there  exists  a  transformation  satisfying  the  following  con¬ 
clusions. 

1)  7(0) -0: 

2)  7,.  7.,-  •  7,  are  functions  of  x,..v;.-  •  -  ..tr  or.iy  and 
the  n  x  n  Jacobian  matrix  is  nonsingular  on  V: 


3)  r„_.  is- a  function  of  jc , ,  x,,- ■  u  which  can  be 
inverted  as  a  function  of  u  and  where  (x,,x2,-  •  -.x,)  €  U\ 

4)  T  maps  the  system  (1)  to  the  system  (2); 

5)  the  map  (7',,7‘2.-  •  •,  r„)  is  one-to-one  on  U  and  T  is 
one-to-one  on  U  xR. 

This  result,  using  condition  a). 


and  £/ *  {(.t,,  x2):  x2<1/2)  can  be  applied  to 
Example  4.2.  If  we  let  Q  be  any  bounded  square  about  the 
origin  in  (r,  t)  space.  U  be  the  same  square  in  (jc,,  jc2) 
space  intersected  with  {(x,,  x2):  x2  <  1/2),  and 


jhen  condition  b)  applies. 

,  At  a  CBMS  Conference  in  1978  at  the  University  of 
California,  Davis,  Brockett  discussed  problems  concerning 
the  mapping  of  nonlinear  systems  and  a  construction  like 
that  given  here.  We  remark  that  our  Assumption  ii)  in 
Section  II  is  weaker  than  a  similar  assumption  in  [2].  In 
that  paper  the  hypothesis  is  that 

[(***/.*).(*/'/.*)]  -  T.  CiiaJV.g). 

i-O 

0£k,j£n-2 

where  d  -  max ( j,  k)  and  the  c,  are  functions.  For  example, 
the  system  " 

Xil  fsinx2l  fcosx, 

i  =»  .  +  u 

x,  i  0  1 

*■  J 

does  not  satisfy  this  condition,  but  is  transformable  in  our 
theory  since  ii)  is  trivially  satisfied. 

We-  remark  that  Theorem  4.2  (and  similar  results)  re¬ 
quires  the  construction  of  the  noncharacteristic  matrix. 
Explicit  constructions  are  possible  for  block- triangular  sys¬ 
tems  [3],  [4],  Ford  is  working  on  budding  transformations 
using  the  symbolic  computation  of  the  M.I.T.  MACSYMA 
Program.  This  appears  to  be  theoretically  possible  for  those 
systems  that  satisfy  Brocketl's  [2]  conditions  and  bilinear 
systems  satisfying  the  assumptions  of  Su  [5].  Also,  other 
systems  can  be  handled,  and  numerical  techniques  are 
being  investigated. 
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n -Dimensional  Controllability  with  (n  —  I)  Controls 

LOUIS  R  HUNT 

A  hit  racl — L*f  M  be  i  connected  real-analytic  .1 -dimensional  manifold, 
/•*,•••  be  complete  real-analytic  vector  fields  on  Af  which  are 

linearly  independent  at  some  point  of  W,  and  it,.-  be  real-valued 

controls.  Consider  the  controliabilify  of  the  system 

n  —  | 

*(0* /(*('))+  2  «, (')*.(■*('))•  *(0)=x0eAf. 

1*1 

Necessary  and  sufficient  conditions  are  given  so  that  tWs  system  is 
controllable  on  any  simply  connected  domain  D  contained  in  Af  on  which 
/,.•  •  are  linearly  independent.  These  conditions  depend  on  the 

computation  of  Lie  brackets  at  those  points  where  f.g,.- ■  ■  .gA~,  art 
linearly  dependent. 


I.  IsmtoDucnoN 

Consider  the  system 


|  •  hie  following  condition  holds.  Every  integral  curve  of  *  which  disconnects 
I  D  contains  a  point  p  where  /  and  g  are  linearly  dependent  and  the  first 
!  integer  /  such  that  ( odJt .  n  and  t  are  linearly  independent  at  />  is  odd 
I  Also  assumed  here  is  that  the  vector  space  dimension  of  the  Lie  ai^fbra 
j  generated  by  /,  g  and  successive  Lie  brackets  ts  2  at  x0  in  oroer  that  we 
j  know  an  open  subset  of  C  is  reachable  from  x0. 

Let  LA  be  the  Lie  algebra  generated  by  f.g,.-  •  -.g.-i  in  (I)  and 
successive  Lie  brackets,  and  suppose  that  the  vector  space  dimension  of 
La  at  x0  is  »  Then  an  open  subset  of  Af  is  reachable  from  x0  by  results  in 
(91  and  |U],  and  we  denote  by  U  the  largest  open  subset  in  M  which  is 
reachable  from  x0  called  the  region  of  reachability  from  x0.  In  [  I ).  (6).  [7), 
and  [8]  this  set  V  (which  is  shown  to  be  connected  in  (7])  is  characterized 
I  in  the  following  way.  The  boundary  of  U.3U.  consists  of  the  (it  —  I)- 
•  dimensional  integral  manifolds  of  (and  hence  of  the  Lie 

I  algebra  L'A )  that  intersect  it  Moreover,  the  integral  curve  of  /  which  suns 
at  a  point  in  3 U  must  remain  in  U,  the  closure  of  U  in  Af . 

Hence,  if  there  are  no  integral  manifolds  of  L'^  in  0  ne  if  every  integral 
.  manifold  hi  of  D  separates  0  into  two  open  disjoint  sets  0,  and  0}  with 
0,  U  0j  U  N  =  0  and  contains  points  where  the  vector  field  J  is  “in  the 
directions  of  0,,”  and  other  points  where  the  vector  field  /  is  “in  the 
direction  of  02.”  then  0  is  controllable  from  any  point  xe6  0  with  the 
vector  space  dimension  of  LA  at  x0  equal  n.  The  problem  is  to  determine 
this  turning  of  the  vector  field  /  through  the  integral  manifold  A’.  This 
must  occur  at  a  point  p  of  S  where  /  and  gt,"-.g.-i  are  linearly 
dependent,  i.e..  a  point  p  where /  is  in  the  tangent  space  to  N  at  p.  Thus, 
we  have  a  local  question  and  we  can  apply  the  beautiful  work  of  Hermes 
13]  to  study  the  turning  of  /  through  S.  The  answers  are  obtained  in  terms 
of  Lie  brackets  at  p.  We  are  using  unbounded  controls  in  our  theory,  and 
locally  this  has  the  effect  of  applying  “impulsive  controls”  in  (3). 

Section  II  of  this  paper  contains  definitions  and  known  results.  In 
Section  III  we  give  examples  and  prove  our  main  theorem. 


II.  Definitions 


We  are  concerned  with  the  controllability  of  the  system  given  in  (I). 

If  A |  and  A2  are  £®  vector  Helds  on  Af.  we  define  the  Lie  bracket  of  A, 
and  A2  by 


[A, .A 


3A, 


1-^A  -iiiA 

3x  *'  3x  ** 


m  —  l 

i(0-/(*(f»+  2  “«(')*.(*('))•  J(0)  =  x0€Af  (1) 

i«l 

where  Af  is  a  connected  real-analytic  n-dimensional  paracompact  mani¬ 
fold.  and  /.  gi.-  •  -.g.-i  arc  complete  real-analytic  vector  fields  on  Af. 
which  arc  linearly  independent  at  some  point  in  Af.  Let  0  be  a  simply 
connected  domain  in  Af  on  which  g,.-  •  are  linearly  independent 
We  are  interested  in  establishing  necessary  and  sufficient  conditions  that 
the  system  ( I )  be  controllable  on  0.  These  conditions  involve  the  compu¬ 
tation  of  the  Lie  brackets  of  /.g,.-  •  -,g,_|  at  those  points  where 
/.gi- J.-t  are  lincarfy  dependent  and  where  the  vector  space 
dimension  of  the  Lie  algebra  LA  generated  by  g,,  g2, . . . g„_ ,  and  succes¬ 
sive  Lie  brackets  is  (it  -  I). 

In  the  special  case  n  =  2.  necessary  and  sufficient  conditions  for  control¬ 
lability  are  given  in  ( 1 0]  for  the  system 

.i(r)=/<x<r))+u(,)g(x(r)).  x(0  )=x0€M.  (2) 

Lei  tade.  f  I.  the  Lie  bracket  of  ( and  c.  fud:e.  £)=j^J t.  i'll,  etc 

1  nen  the  system  (2)  is  controllable  on  0  from  any  point  x0£  0,  where  0 
is  a  simply  connected  domain  in  Af  on  which  g  is  nonzero  if  and  only  if 
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where  3A,/3x  and  3A2/3x  are  Jacobian  matrices.  For  our  purpose,  we 
could  have  taken  [A,,A:]  to  be  the  negative  of  the  above  expression  as 
well.  Of  course  it  is  possible  to  define  higher  order  Lie  brackets 
{A,, |  A,.  Aj]],  |A:.|A,,  A;]]  •  •• .  Wc  set  (adh,.  A2)=E|A1,  A2)  and  induc¬ 
tively  <a«/‘*lA,.A2)s:[A,.(a/4A,.A2)J.  Let  l.A  and  l.'A  be  the  Lie  alge¬ 
bras  generated  by  f.g„ and  successive  Lie  brackets  and 
If  i.  1  and  successive  Lie  brackets,  respectively,  where  f.g,.- 

arc  defined  in  (I).  For  x€W.  we  set  LA(x)  =  {h(x):  h€LA)  and 
■  Li(x)={A(x):  h€L'A).  The  set  of  vector  fields  g,.- is  called 
invotuUve  at  x  if  there  exist  functions  y,jt  such  that 

[.?,*/)(*)  =  2  T.,*(->r)jr*(jr)  for  all  i,j.  1  <  i.  j  «  n  - 1 .  i  -  j. 

k  »  I 

If  this  holds  for  all  x€  Af.  then  g,.-  ■  -.g„- 1  are  invo/utive  on  AT. 

By  T(M)  we  denote  the  tangent  bundle  to  Af  with  fiber  T.(M)  for 
x  €  Af .  the  tangent  space  to  Af  at  x.  If  X  is  a  (?®  vector  field  on  Af ,  then  a 
is  an  integral  curve  of  X  if  tt  is  a  £®  mapping  from  an  interval  /  C  ft  into 
Af  such  that  da(t)/dt  -  X(a<:))  for  all  i£  I.  For  S  a  subset  of  T(  Af ).  an 
integral  curve  af  S  is  a  mapping  e  from  a  real  interval  ft.  (')  into  Af  so  that 
there  exist  t  =  r0  c  /,  <  r2  <  ■■■  <it  =  t'  and  vector  fields  X„---.Xk  in  5 
with  the  restriction  of  a  to  r,]  being  an  integral  curve  of  X ,,  for  each 
i  =  I.2.-  ■  -  .A.  A  connected  submanifold  V  of  A f  is  an  integral  manifold  of 
g,.-  -.g,-,  if  Tv(  A’>  is  the  space  spanned  by  g,,-  •  at  y  for  each 

,i  €  N. 

The  subset  S  of  Af  wc  consider  is  the  one  given  by  the  vector  fields  in 
our  system  (1)  A  point  x  6  Af  is  reachable  from  x06  Af  if  there  ts  an 
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integral  curve  a  of  S  and  some  lime  T> 0  in  the  interval  for  a  such  that 
a(0) *  v„  and  ntD* x.  A  subset  A  of  W  is  rcarhuhlr  from  ,t0  if  every 
.re  .-I  i>  reachable  from  a„.  In  the  ease  A  -  M.  the  system  is  cemtroUahte 
from  Xf  If  the  system  i*  controllable  from  every  in  Vf .  then  it  is 

Wr  apply  the  above  definitions  to  our  simply  connected  domain  D  in  M 
on  which  ?..  ■  -.g.-,  ore  linearly  independent.  In  fact,  since  our  main 
results  arc  stated  for  D.  we  may  as  well  assume  that  M  ~  D.  Only  in  the 
examples  following  Theorem  3.1  in  Section  III  will  we  consider  cases 
where  D  is  properly  contained  in  M 

Letting  i  be  a  point  in  D.  »e  need  to  examine  the  possibility  of  having 
an  integral  manifold  Af  of  g,.-  •  -.g,_ ,  through y.  The  real-anaiytic  version 
of  the  famous  Frobcnius  theorem  sutes  if  gt.-  ■  ,  are  involutive  at  y 

for  equivalently  the  vector  space  dimension  of  L'A(y)  it  (*  — 11),  then 
there  is  a  unique  real-analytic  <  a  -  D-dimenaoaal  integral  manifold  S'  of 
g„-  -  through  v.  It  there  is  a  point  y'eiSnD.  then  the  vector 
l>dds  g •  ■  .g„_  |  are  involutive  at  y"  by  continuity  and  »■' €  Af  O  D,  Thus, 
an  ( n  - 1 Himenstonal  integral  manifold  of  g,.-  ••.gm-i  in  B  can  have  no 
boundary  points  in  D.  Of  course,  if  the  vector  space  dimension  of  LA(y) 
is  n.  then  them  is  no  integral  manifold  of  g of  dimension 
in  —  I)  through  y.  and  an  open  neighborhood  0  of  y  in  D  is  reachable 
from  any  point  in  0  by  Chow's  theorem  (see  (1 1]). 

We  emphasize  that  we  am  not  assuming  that  the  vector  space  dimen¬ 
sion  of  l.'A  is  a  constant  on  D.  It  can  vary  as  we  move  through  D.  but 
must  always  be  either  (n  —  I)  or  <t.  Points  where  this  dimension  is  (it  —  I) 
am  interior  points  of  integral  manifolds  of  gi."*.g..|.  We  say  that  an 
integral  manifold  N  of  g,,-  •  -.g.-,  disconnect:  0  it  there  are  disjoint, 
nonempty  open  sets  0,  and  CS  “complement  in  D  of  (0(  U'Af)  with 
O,u0;U,VaD  (Lc..  Af  separates  D  into  disjoint,  nonempty  open  sets  0, 
and  Oj).  In  view  of  the  preceding  paragraph  and  the  fact  that  0  is  pimply 
connected,  the  following  assumption  is  reasonable.  For  the  remainder  of 
this  paper,  we  assume  that  every  integral  manifold  of  ii.‘".g;.|  in  0 
disconnects  D. 

Let  V  be_an  open  subset  of  D  and  take  x€  9K.  Then  /  poaut  in  the 
direction  vf  V  (or  io*urd  V)oi  xit  there  is  an  open  neighborhood  M'otx 
in  D.  such  that  die  integral  curve  of  /  starting  at  x  and  intersected  with  W 
is  contained  in  V.  If  this  is  true  for  ail  a  6  9K,  then/patau  in  the  direction 
of  V  on  IV  We  define  /  pointing  in  ike  direction  of  V  by  replacing  V 
everywhere  with  V.  * 

Our  first  two  results  are  proved  in  [6], 

Theorem  2.1:  Assume  that  the  vector  space  dimension  of  LA  at  x0S  D 
is  n.  Let  L  be  the  largest  open  subset  of  D  which  is  reachable  from  x0 
Then  ill  consists  of  integral  manifolds  of  g,,*  •  -,g,_,  and / points  in  the 
direction  of  U  on  i(J. 

If  there  is  an  integral  manifold  N  of  g, disconnecting  D  into 
O,  and  0}  with  /  pointing  toward  0,  (or  0j )  on  Af,  then  the  system  is 
certainly  not  controllable  on  D  by  results  in  (6). 

Let  X1  denote  Hauidorff  measure  (see  J21)  in  dimetuion  d  on  D. 
Suppose  /.  is  the  set  of  points  on  which  the  Lie  algebra  VA  has  dimension 
(it  - 1).  Then  any  integral  manifold  S  of  g,.-  •  -.g,_ ,  is  contained  in  L. 
and  for  such  a  manifold  wc  must  have  A"" '( L)> 0. 

Theorem  2.2:  If  A*“'(  L)*0  then  our  system  (I)  is  controllable  from 
any  x0€  D. 

Next  we  turn  to  the  results  of  Hermes  [3)  concerning  local  conirollabil- 
i:y  along  a  reference  solution.  Let  p  be  a  point  in  D  where  f{p)m0.  f 
being  as  in  our  system  (I).  That  is.  p  is  a  critical  point  or  rest  solution 
when  all  controls  are  zero.  Suppose  that  the  vector  space  dimension  of 
L'A(p)  is  fa -I).  We  note  that  g,.---.g„_,  do  not  have  to  be  linearly 
independent  in  |3].  but  we  assume  they  ire  for  now-*nd  discuss  how  this 
assumption  can  be  removed  for  our  entire  theory  later  in  this  paper.  We 
also  remark  that  the  results  in  (3)  art  stronger  than  indicated  here. 

Let  ( • .  )  denote  the  inner  product  of  tangent  vectors  induced  by  a 
Ricmanman  metric  on  0.  By  A  we  denote  a  vector  Held  on  D  so  that 
gt.  ■  ,.  h  are  linearly  independent  at  p  Let  I  be  the  unique  vector 

such  that 

<H/»)  h(p»*  1.  </(p).g,(p)>-0.  .-I.' 

Set  r  1).  r  a  nonnegauve  integer 


0  —  I 

,•"*  2  p ■* ri  ■ "  r.-i '■ 

<»  i 

'gm-,.l.od’--g..i.(  -  ■  .(arf*'g,. /).•••  )(p), 

*'(*)-  2  77(-*« 

r 

where  i r , .  -  -  .r..,)  is  any  point  in  a  neighborhood^  the  origin  in 

The  following  theorem  con  be  deduced  from  the  results  in  (31. 

Theorem  2.3:  Consider  the  system  (I)  with/(p)*0  and  with  the  vector 
space  dimension  of  LA(p)  equal  (n  — I).  and  let  Af  be  the  integral 
manifold  of  g,..- (and  hence  L'A)  through  p.  Assume  that  there 
exists  an  integer  r>i  such  that  o,(r)m'0  for  s  in  some  ncighboriiood  of 
the  origin  in  &**',  and  let  r*  be  the  smallest  integer  with  this  property. 

1)  If  0  is  any  open  neighborhood  of  p  in  D.  let  Ar  disconnect  O  into 

two  disjoint  open  sea  0,  and  0j.  It  PAD  changes  sign  in  every 
neighborhood  of  the  origin  in  ft""',  then  there  exist  Af  CiO.  such 

that  /  points  toward  O,  at  .i>,  and  toward  0;  at  ty. 

2)  If  O  is  any  open  neighborhood  of  p  in  D.  let  Af.  0„  and  0j  be  as 
above.  Then,  if  PA*)  »*  semtdefmite.  l?ulP3s)  changing  sign  in  every 
neighborhood  of  the  origin  in  Q.m~ 1  is  a  necessary  and  sufficient  condition 
that  there  exist  y,,yj€Afn0.  such  that  /  points  toward  O,  at  .i>,  and 
toward  O-  at  yj. 

The  task  is  to  merge  Theorems  2.1.  12,  and  2.3  to  give  necessary  and 
sufficient  conditions  for  our  system  to  be  controllable  on  D 

III.  Results  and  Exampus 

We  first  prove  the  main  theorem  and  then  give  a  number  of  illustrative 
examples.  Recall  that  wc  axe  assuming  g„-  •  -.g,.,  are  lineariy  indepen¬ 
dent  or.  B.  til  ( «  —  i  l-dirncrotooal  integral  manifolds  Ot  If  ".*.-1 
disconnect  D.  and  /. g„-  •  -,g._ ,  are  linearly  independent  at  some  point 
off). 

U  tht  vector  space  dimensior.  of  VA  is  a  at  every  point  of  D.  then  the 
system  is  oontroQabie  on  D  by  Theorem  2.2  (see  [4J.  (10J).  If  the  vector 
space  dimension  of  LA  is  ( n  - 1 )  at  some  point  x  ot  D.  then  l'A(x)*  LA(x) 
and  by  Nagano’s  theorem  there  is  a  (a  —  If-dimenstonal  integral  manifold 
of/,  g,.  -  -,g._ ,  through  this  point  which  disconnecu  D.  It  is  inqftasibk 
to  move  from  one  side  ot  this  manifold  in  f>  to  the  other,  so  this  system  is 
not  controllable  by  the  comment  after  Theorem.  2.1. 

Thus,  we  asttimr  that  the  dimension  of  lA  is  a  at  every  point  of  D  and 
the  dimension  of  L'A  is  ( a  —  1)  at  some  point  in  D  (i.e..  we  have  at  least 
one  integral  manifold  of  L'A  (of  g|."  .g..i  in  Z>).  Suppose  that 
/•*!•' '  1  ore  linearly  independent  at  every  point  of  Af.  Then  we  have 

an  integral  manifold  disconnecting  D  into  two  disjoint  open  sea  0,  and 
Oi  such  that  /  points  in  the  direction  of  one  of  the  sets,  say  0„  on  N.  and 
the  system  is  not  controllable  by  the  comment  after  Theorem  II. 

Hence,  wc  additionally  assume  that  every  integral  manifold  S’  of 
«i-  "  -f.-i  i*  0  contains  a  point  p  where  /.g„- ■  -.g.-,  are  lincarty 
dependent.  Suppose  that  c,.-  -.r..,  ore  constants  such  that 

/(f)*  *i  Si\p)+  ■■■  +t.-ig.-i(p)- 

If  we  replace  u,  by  u,  -  c,.  uj  by  ii«  -  r,,-  •  •,  u,_,  by  in 

system  (l).  then  we  have  i  new  “equivalent"  system 

tit:*"  ■■■  »f..if..,))(*(')) 
m  •  I 

*  2  -.(')*,(*(')).  (3) 

I m  I 

This  new  system  Has  the  property  that  p  is  a  critical  point  (or  the  drift 
term  (i.e..  when  u,,-  ■  |  are  all  0).  Moreover.  Theorem  2.3  is  applica¬ 
ble  to  system  (3).  Since  ‘ 

[/-(fiit 
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and  is  in  the  Unger.:  bundle  to  N.g,.-  ,g,-!  being  involutive. 

Otc  Lie  brackets  [  /.  g, )  and  !/-(c,g,~ r.g,  - - c,.  ;g„. , ».  g,!  dif¬ 

fer  by  vecior  field.'  in  the  tangent  bundle  to  S'.  Thin  is  also  true  for  all 
Micce j.ive  Lie  brackets  of  interest  to  us.  We  can  define  the  functions  e>, 
and  associated  with  Theorem  2.3  for  our  system  (I)  where / is  tangent' 
to  V  at  p.  Since  we  are  interested  in  the  direction  cf  /  on  ,V  near  p.  the 
information  given  to  us  by  Theorem  2.3  applied  to  system  (3)  is  the  same 
as  that  given  by  direct  calculations  from  system  (1)  In  fact,  the  numbers 
a(  .- )  given  by  system  (1)  and  system  (31  agree  for  all  »  because  the  inner 
product  of  l(p)  with  any  tangent  vector  to  At  at  p  is  0. 

Thus,  we  have  proved  the  following  theorem  in  which  e.  and  are  as 
in  Theorem  2.3. 

Thtortm  3.1:  The  system  (1)  is  controllable  if  any  one  of  the  following 
conditions  is  satisfied. 

1 )  The  vector  space  dimension  of  L‘4  is  n  at  every  point  of  D. 

2)  Every  integral  manifold  S  of  f,.-  •  in  D  contains  a  point  p 

where  /.  g,.-  ■  -,g._ ,  are  linearly  dependent  and  0,4  r)  changes  sign  in 
every  neighborhood  of  the  origin  in  St*-1. 

-  3)  Every  integral  manifold  S'  of  g(1-  •  1  in  D  contains  a  point  p 

where  /. Ji.- •  -.g,-,  are  linearly  dependent.  is  semideHnite  on 
some  neighborhood  of  the  origin  in  ft"-1,  and  iWi)  changes  sign  in 
every  neighborhood  of  the  origin  in  6t*“  *. 

~  The  system  ( 1 )  is  not  controllable  if  any  one  of  the  following  condi  lions 
~i*  satisfied. 

4)  The  vector  space  dimension  of  LA  is  (ft  —  t)  a*,  some  point  of  D. 

5)  There  is  an  integral  manifold  S'  of  g,.-  •  -,g„_ ,  on  which 
/•Si'"  ' 'S«- 1  *re  linearly  independent 

6)  There  t$  an  integral  manifold  V  of  g|.'".g„-i  so  that  for  every 

point  pGN  at  which  /.gi. are  linearly  dependent  there  is  an 
open  neighborhood  of  the  origin  in  9."~l  on  which  4y*<r)  is  semidefinite 
and  docs  not  change  sign. 

Throughout  this  paper  we  have  assumed  that  g,.- •  -.g,.,  are  linearly 
independent  on  D.  Since  Theorem  II  (see  (7))  and  Theorem  2.3  are  valid 
if  we  replace  this  assumption  by  the  hypothesis  that  the  vector  space 
dimension  of  L4  on  D  is  (»» -  I).  Theorem  3.1  again  holds. 

W  e  present  a  scries  of  examples  which  demonstrate  the  application  oi 
Theorem  3.1  to  a  variety  of  systems  Examples  for  dimension  2  are  given 
in  (SJ. 

Example  3.1:  We  take  M  =  D  “  ft1  and 

*i  *1  To 

*1  =  /(•*('))  +  “i(»)  +*: (')  0 

j  M  l»J 

*/(x(r))  +  u,(r)g,(x(f))+«:(r)gj(x(»)) 
where /  is  any  real-analytic  vector  field  on  Q?.  Since  gj. 


[fi-ii]’ 


[gf(gi.*j]]”|0j 

•  ' 

are  linearly  independent  on  ft*.  the  vector  space  dimension  of  L\  on  ft3  is 
2.  and  part  I)  of  Theorem  3.1  implies  that  this  system  is  controllable  on 
*i\  Note  that  part  I)  of  Theorem  3.1  certainly  docs  not  require  that  g, 
and  gj  be  linearly  independent  on  D. 

-  Example  J.3:  Let  M  “  D  *  *51 3  and 


x'  +  xj  +  xf 


0  0 

•«!(')  0  1 

.1.  .0 


x -  xi  ~  xj  =0.  i.e  .  at  the  origin.  The  integral  mar-folds  of  g;  and  g; 
arc  the  hypcrplanes  x ,  “constant  Tnus.  :hcre  arc  integrals  manifolds  of 
g,  and  g.  which  do  not  contain  a  point  where /.  g,.  and  g:  are  dependent. 
By  pan  5)  of  Theorem  3.1.  this  system  is  noi  controllable  on  ft’. 
Example  3.3:  We  ukc  M  “  0  “ <?.3  and 

i,l  x|  +  xi  fol  [O' 

x2  =  ,  -  u,(r)  0  -  ajfr)  I 

1  li]  U 

=  /(x(r))x«,(/)g.(x(r))  +  «,fr)y:(x(/)). 

The  vector  fields/,  g|.  and  gs  are  linearly  dependent  when  x 5  -  xj  “0. 
Le..  on  the  x(-axis  The  integral  manifolds  of  g,  and  gj  art  the  hyper- 
planes  x,  “constant,  and  each  of  these  intersects  the  x,-axis  when  x-  =  x, 
“0.  Computing  Lie  brackets  we  find 

2x,l  [2x, 

(/.g.]=  0  •  [/•*:)=  0  • 

0  0  . 

([/.fil-*i)*[ol.  [[/■  g. ]•  fc] *  f 0  . 


K/.gj).ji}*  0  .  ll/.gj].gj]=  0 


Note  that  both  [/.g,]  and  [/,g2)  are  I  0  I  when  x-  *x}  *0.  Thus,  with 


/(p)«=  0  and  r*“2  in  our  formulas  for  p, 

loj  - 

which  is  positive  definite  in  a  neighborhood  of  (0.0)  In  By  part  6)  of 
Theorem  3.1  this  system  is  not  controllable,  since 

X  *•(*)*  *•(')• 

rm  I 

Example  J.4:  Let  M  “  D  =R3  and 

X|  [X,  1  [x,|  [x, 

X;  *  X,  +  u,(/)  X,  +V,(f)  X, 

i}  lx>J  1*1  J  lx;, 

*/(x(*))+“t(')*t(*(0)  + 

The  vector  fields /,  g,.  and  gj  are  linearly  dependent  when  x?  +  xj  +  x} 
-  3x,xjx3  “0.  and  g,  and  g}  are  linearly  dependent  when  x,  *  x;  “  Xj. 
There  is  a  point,  e.g..  x,  *0,  x2  “  -  x,  «*0,  where  /,  g|t  and  g;  are 
dependent  hut  g|  and  g]  arc  independent.  Let  0  be  any  simply  connected 
domain  in  6l3  containing  this  point  and  not  intersecting  the  line  x,  =  x}  = 
xj.  Since 


*/(x(())m  «i(r)gi(x(r))+  i»-(r)g-(  r(O) 

The  vector  field*  /,  g,.  and  g.  are  linearly  dependent  if  and  only  if 


the  dimension  of  L4  at  any  point  of  D  is  determined  by  the  span  of  /.  g,, 
and  g.  at  this  point.  Hence,  there  is  a  point  in  D  where  the  dimension  of 
L„  is  2.  and  our  system  is  not  controllable  by  part  *)  of  Theorem  3  1. 
Example  J.E  W>  take  A4  *  D  “  ‘A’  and 
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ir-i 


u 


# 

x\ -  xj 

0 

lfol 

* 

1 

0 

1 

.  1 

L  o  j 

‘ /(*(')) +  “|( Oii(  *('))  -“:(')*:(  •(••»■ 


The**  three  vector  field*  are  linearly  dependent  :f  and  only  if  (•  -  x}  =0 
X).  Computing  Lie  bracket*  we  find  that 


-2a, 

0 

and  l/.gj}m 

2x2 

0 

.  0  . 

L  0 

No*  the  integral  aunifotd*  of  g,  and  Xj  are  gjven  by  a,  *  constant  and 
coca  contain*  the  lias  i;*:i)  when  a,  it  held  fixed.  If  we  let 


«gH 


.  xt  *  I  and  Xj  •  —  1.  and  r*«  I  we  have 


We  find  that 


Letuni 


1/-X.1*  ~5 


HP)- 


"A 

x  i  [  /■  f :  i 3 


0 

4 

lOJ 


2/y/5 

l/v/5 

0 


and  r*«  1 


we  have 


MO  *(”*i)'(  “*:)*(  “20/ x2)  and  x.-OinO,. 


and  Theorem  3.1.  pan  2)  gives  us  global  coa (reliability. 

Exempt*  J.A.  la  our  preceding  example,  first-order  Lie  bracket*  were 
applied  to  imply  controllability  Here  we  u*c  third-order  bracket* 

Take  and 


‘it 

* 

'  J_ 

xi  “Xj 

1 

+  -.(0 

0 

0 

-riij(/) 

0 

,0, 

1 

.1. 

.0. 

-/(a(»))  +  u,(«)gl(a(r))-h.I(,)fc(a(r)). 

We  know  that  /.  g,.  and  g2  are  linearly  dependent  only  on  the  x,  axia  and 
the  plane  »;*- Xj.  Every  integral  manifold  of  g,  and  g2  imersaas  the  x, 
axis.  Computing  when  x2  *  x2  *0  we  find  that  the  only  Lie  brackets  of 
the  first  three  order*  that  arc  nonsero  are 


« 

0 

.0 


Letting /(g)* 


andr*»3  we  have 


which  satisfies  condition  2)  in  Theorem  3.1 -and  implies  control  lability. 

If  w*  choose  to  work  at  those  points  where  xt  *  —  x,  f*0.  instead  of 
along  the  x,-axi*.  first-order  Lie  brackets  will  suffice, 
frwplr  IT:  Let  At ■  4* - (xj-axis)  and 


V 

*» 

[01 

/>. 

m 

.  o  . 

+  «t(0 

-4*2 

.  o  , 

+«,(») 

0 

.1. 

■ /(*('))+  *|(0*|(*('))+  “:(0*i(*(0)- 

Now  g,  and  g2  are  linearly  independent  on  At  and  /.  g,.  and  X:  are 
dependent  if  and  only  if  x,  *  *2x2. 

Let 


By  Theorem  3.1.  pan  2).  our  system  is  controllable  on  C ,. 

Similar  arguments  show  that  the  system  is  controllable  on  02.  02.  an 
D,.  However,  we  can  move  from  0,  to  0-.  from  0.  to  0>.  frotr.  D-  to  D, 
and  from  0»  to  0,  by  using  the  vector  field  /.  Thus,  the  system  is  actuall 
controllable  on  .Weft1-  (xj-axis).  Note  that  the  vector  space  dimensio 
of  L4  is  I  on  the  xj-axis. 

In  our  last  example  we  were  able  to  apply  our  theory  on  0..  02.  0- 
and  04  and  make  observations  concerning  iD,.  3D..  3D,.  3Dt  ie  *?.1  i 
show  the  system  was  controllable  on  At  »  S.1  -(xj-uus)  We  remark  thi 
30,  -  (xj-axis)  consists  of  the  disjoint  union  of  two  integral  manifolds  e 
Xi  and  Xt*  neither  of  which  disconnects  At  or  intersects  the  lines  x,  * 
*2x2.  The  same  is  true  for  each  of  30*.  30,.  and  3Dt.  This  illustrau 
how  our  theory  together  with  other  techniques  can  yield  global  solution 
We  now  give  an  example  in  which  the  control  vector  is  not  constant  a 
linear. 

Exempt*  J.A-  We  let  V«R:  and 

Set  0*  *{(X|,x2)6lf  *:  a,  >0)  and  D~  *{(x,.x:)€R::  x,<0).  Obvi. 
ously.  we  can  move  from  0*  to  0“  or  0“  to  0“  using  the  vector  field/. 
Thus,  we  prove  that  the  system  is  controllable  on  0“ .  a  similar  argument 
being  applied  on  D~  to  give  us  controllability  on  RJ. 

The  vector  fields  /  and  g  are  linearly  dependent  in  D*  it  x.  *0,  and 
every  integral  curve  of  g  contains  such  a  point.  Computations  yield 

>/•■!-[  rl- 

a/.*].*i»[6xjxf]. 

in/.*].  *)•*]«[ 

If  we  let  /(g)*  ^  J.  x:  *0,  and  r**3,  we  have 


0,  *{(x,.  x,.x,)S«:  x,.x2  >0} 

02  *  {{X|,Xj.Xj)S  Af :  x2  >0  and  X|  <0} 
0j*{(*i-**i-*j)S  V:  x,.x,<0} 

0,  •  {(x,.x2.Xj)€  At-,  x,  >0  andx2  <0). 


which  satisfies  condition  2‘,  of  Theorem  3.1  to:  even,  x,  >0  We  have  s 
controllable  system. 

In  this  paper  we  have  concentrated  on  the  problem  of  controllability  for 
systems  with  {/»  —  !)  controls  oo  u  ir-dtmeasionai  manifold.  An  open 
problem  is  to  provide  a  similar  theory  if  the  number  of  control  vectors  is 
less  than  (*  - 1)  using  the  results  of  [I).  [7],  and  jS). 


Every  integral  manifold  of  g,  and  j;  in  0,  intersect*  the  line  x,  *lx;. 
and  0,  is  foliated  by  such  integral  manifolds  since 


jiM  yfffyyyCT  A  A'.1*-  y-*;'.*  '-'Trr-7'  '-7' 
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Sufficient  Conditions  for  Controllability 

LOUIS  R.  HUNT,  member,  ieee 


Abstract — Th*  problem  is  to  find  sufficient  conditions  for  the  system 

m 

i(')=/(*('))^  2  “.(')*.(*(')).  x(0)=xoeM 

I «  I 

to  be  controRable.  Here  M  hi  connected  5“  n-dimensionaf  manifold,  /, 
Ii.-  ■  -,gm  ice  complete  t?"  vector  fields  on  M.  and  .  um  are  real-val¬ 
ued  controls,  if  m  =  «  -  1.  M.f.  g,.-  ■  are  reaJ-anai\tic.  M  is  simply 

connected,  and  g,.-  ■  • .  ,  are  linearly  independent  on  M.  then  necessary 

and  sufficient  conditions  are  known.  For  the  case  of  our  (2“  system  with 
general  m.  we  assume  that  the  space  spanned  by  the  Lie  algebra  LA 
generated  by  /.  g,.-  -.g«,  and  successive  Lie  brackets  has  constant 
dimension  p  on  M  and  the  algebra  VA  generated  by  g,.-—,gm  and 
successive  Lie  brackets  has  constant  dimension  />'<  p  on  M.  If  p' -  p. 
Chow's  Theorem  implies  controllability  for  a  p -dimensional  submanifold  of 
M  containing  jr0.  If  p'<  p,  sufficient  conditions  are  found  involving  the 
computation  of  certain  Lie  brackets  at  points  where  the  vector  field  /  is 
tangent  to  the  integral  manifolds  of  L'A.  Here  we  assume  that  every  integral 
manifold  of  LA  contains  such  a  point. 

1.  Introduction 

LET  M  be  a  connected  st-dimensional  paracompact 
manifold,/,  g,,-  •  -,gm  be  complete  6“  vector  fields  on 
M ,  and  it|,‘  •  -,um  be  unbounded  (or  unlimited)  real-valued 
controls.  We  are  interested  in  finding  sufficient  conditions 
that  the  system 

*(0  =  /(*(0)+  2  “/(0*/(*(0)v  x(0 )  =  x0GM 


be  controllable.  Let  LA  be  the  Lie  algebra  generated  by  /, 
g,,-  •  -,gm  and  successive  Lie  brackets,  and  suppose  that  the 
vector  space  dimension  (i.e.,  the  space  spanned  by  the 
vector  fields  in  LA)  of  LA  on  M  is  p.  By  Chow’s  Theorem 
the  set  of  points  in  M  which  are  reachable  from  x0  are 
contained  in  a  <2*  p-dimensional  submanifold  S  of  M 
through  the  point  x0.  We  could  develop  our  controllability 
theory  for  this  submanifold  5,  but  to  conserve  notation  we 
assume  that  p  —  n  and  S  —  M. 

Thus  the  supposition  is  made  that  the  vector  space 
dimension  of  LA  on  M  is  n  at  every  point  of  M.  Hence, 
given  an  arbitrary  point  x0  in  M,  there  is  an  open  subset  of 
M  which  is  reachable  from  x0  by  results  from  [8]  and  (10]. 
In  [6]  it  is  shown  that  this  open  set  can  be  taken  so  that  its 
closure  contains  x0.  A  characterization  of  the- largest  open 
subset  of  M  which  is  reachable  from  x0,  called  the  region 
of  reachability  from  x0>  is  given  in  (1),  (6),  and  (7). 


Manuscript  received  February  9.  1981;  revised  July  7,  1981  and 
November  6.  1981.  This  work  wai  supported  in  part  by  the  Joint  Services 
Electronics  Program  under  ONR  Contract  N000I4-76-CI 136 
The  author  is  with  NASA  Ames  Research  Center.  MS  210-3,  Moffett 
Field.  CA  94035.  on  leave  from  the  Department  of  Mathematics,  Texas 
Tech  University.  Lubbock.  TX  79409. 


Under  the  assumption  that  m  =  n  —  1,  M,  /,  g,,  •  ■  •  ,g„_ , 
are  real-analytic  on  M ,  M  is  simply  connected,  and 
g,,  -  •  -  ,gB_,  are  linearly  independent  on  M ,  necessary  and 
sufficient  conditions  are  proved  in  (4)  for  our  system  to  be 
controllable  on  M.  These  conditions  depend  on  results 
found  in  [2]  and  [5].  It  is  assumed  that  the  vector  space 
dimension  of  the  Lie  algebra  L'A  generated  by  g,,-  •  ■  ,g„_ , 
and  successive  Lie  brackets  is  greater  than  or  equal  to 
(n-1). 

With  our  (2*  manifold  M  and  general  m  we  suppose  that 
the  vector  space  dimension  (i.e.,  the  spanning  dimension) 
of  L'a  at  every  point  of  M  is  the  constant  k.  If  k  =  tt, 
Chow’s  Theorem  implies  controllability  of  the  system  on 
M.  If  k  <  n,  results  from  (6]  and  [7]  mentioned  previously 
and  a  theorem  due  to  Hermes  [3]  implying  local  controlla¬ 
bility  along  a  reference  solution  are  combined  to  give 
sufficient  controllability  conditions. 

If  every  integral  manifold  N  of  L'A  in  M  contains  a  point 
x  where  /  is  tangent  to  N  (included  in  this  is  the  possibility 
that  /  vanishes  at  x),  then  our  system  is  controllable  if  the 
following  conditions  hold  for  at  least  one  such  x  in  each  N. 
There  exist  a  basis  A,.  -  ■  -  ,hk  of  LA  near  x  and  integers 
/,,•••,/*  such  that  the  space  spanned  by 

(A,(x),-  •  -,hk(x),(adf,  A,)(x),-  •  •  ,{adf,  hk)(x),- • 

(ad'<f,hl)(x),--->{aduf,hk)(x)) 

has  dimension  n.  The  vector  field  /  is  defined  in  the 
following  manner.  Let  c,,-  •  •,ck  be  constants  so  that  f  — 
c,A,  —  c2A2  —  •  •  •  —  ckhk  vanishes  at  x  and  set 

/=/-  2  c,h,. 

i- 1 

Also  (ad/,  h)  =  [/,  A],  the  Lie  bracket  of  / and  h.  (ad1/,  h) 
=  [/,[/,  A]],  etc.  If  g,,-  •  -  ,gm  are  linearly  independent  and 
involutive  on  M,  the  above  condition  is  replaced  by  the 
space  spanned  by 

(SiC*)’' ' ' '£«(■*)•  {adf >  gi)(x),  -  •  ■  ,{adf ,  gm)(x).  -  ■  • , 

{adl'f,gi)(x),---,{ad'"f%  gm)(x)} 

is  of  dimension  n.  Of  course  this  generalizes  the  known 
controllability  matrix  criterion  for  linear  time-invariant 
systems. 

II.  Definitions 

We  are  interested  in  the  controllability  properties  of  the 
system  (1). 

Let  T(M)  be  the  tangent  bundle  to  M  wuh  fiber  TX{M) 
for  x  e  M,  the  tangent  space  to  M  at  x.  For  Jfac*  vector 
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field  on  \1.  a  is  an  integral  curve  of  X  if  a  is  a  cx  mapping 
from  an  interval  I  Z  R  into  .V;  such  that  da t :  i  d:  =  A't  a( ;  n 
for  all  i  E  /  If  T  is  a  subset  of  7t  St  i.  an  integral  curve  of 
V  is  a  mapping  a  from  a  real  interval  [/.  r  ]  intc  M  so  that 
ihere  exist  t  = <:  t.  < ;;  <  •  •  •  <  t  =  t'  and  vector  fields 
.V, . •  -  -V  in  7*  with  the  restriction  of  o'  to  (r  ]  being 
an  integral  curve  cf  A'.  roi  each  /  =  1.2.  ■■  A  connected 
submanifold  .V  of  A/  is  an  tnteg'al  manifold  of  vector  fields 
A',.  •  •  •  Xk  if  TX(S )  is  the  space  spanned  by  Xk  at  y 

for  each  _y€  i V. 

Tne  subset  T  of  M  we  consider  ts  the  one  given  by  the 
vector  fields  in  our  system  (1).  A  point  xE  M  is  reachable 
from  XqE  M  if  there  is  an  integral  curve  a'  of  T  and  some 
time  i  >  0  in  the  interval  for  a’  such  that  a'(0)  =  x0  and 
a‘(r)  =  x.  A  subset  A  of  M  is  reachable  from  xc  if  every 
point  xE  A  is  reachable  from  x0.  If  an  open  subset  of  M  is 
reachable  from  x0,  the  largest  such  open  set  is  called  the 
region  of  reachability  from  xa.  If  the  system  is  controllable 
from  every  x0  in  M.  then  it  is  controllable.  Let  <p(i.  x0)  be 
the  solution  of  (l)  at  time  t  with  <p(0.  x0)  =  x0*and  which 
corresponds  to  all  controls  u,  being  0.  The  system  (1)  is 
locally  controllable  along  <p  at  time  t>0  if  all  points  in  some 
n-dimensional  neighborhood  of  g»  t,xQ)  can  be  reached  at 
time  t  by  solutions  of  (!)  initiating  from  x0. 

Let  0  be  an  open  subset  of  M  andjake  xEdO.  Then  / 
points  in  the  direction  of  O  (or  toward  O)  at  jc  if  there  is  an 
open  neighborhood  W'r  of  x  in  A/  so  that  the  integral  curve 
of  /  starting  at  x  and  intersected  with  W  is  contained  in  O, 
the  closure  of  O  in  Af.  If  this  is  tnie  for  all  xE  30.  then / 
points  in  the  direction  of  O  on  30.  We  define  /pointing  in 
the  direction  of  O  by  replacing  0  everywhere  with  0. 

If  X  and  Y  are  <2"  vector  fields  on  M.  we  define  the  Lie 
bracket  of  X  and  Y  by 

l*r)-g*-T£r 

where  dY/dx  and  dX/dx  are  Jacobian  matrices.  Higher 
Order  Lie  brackets  such  as  [  JC,(  X,  /]],  [y,(  X,  X]],  •  •  •  can 
be  defined.  We  let  (adX.  Y)  =  f  X,  Y]  and  inductively 
( adJ~'X Y)  =  [X,(ttdJX,  X)].  By  LA  and  L\  we  denote  the 
respective  Lie  algebras  generated  by  f,g \,---,gm  and 
successive  brackets  and  g,,-  ■  -,gm  and  successive  brackets 
where  /  g,.---,g„  are  defined  in  (1).  The  set  of  vector 
fields  g|, •••,£„  is  called  involutive  if  there  exist  functions 
v  4  such  that 

m 

[«,.?/](•*)=  2  y„i (x)gk(x) 

k  *  I 

for  all  i.  j.  1  <i.j<m,i  +  j. 

III.  Main  Results 

To  prove  our  controllability  results  we  combine  the  local 
controllability  along  a  trajectory  work  of  Hermes  [3]  with 
the  local  to  global  theory  found  in  [7],  The  firs;  theorem  is 
due  to  Hermes. 

Theorem  3.1.  Let  <p(t.x0 )  be  a  solution  cf  (1)  corre¬ 
sponding  to  all  u,  =  0.  A  sufficient  condition  that  the 


system  ( 1 ;  be  localh  controllable  along  at  time  ;>0  is 
that  there  exist  integers  /  .•■•./„  such  that  the  space 
spanned  by  fg;t jcc ).  •  -.gj.v0i.  iadf.g  H.x0).-  -.(adf. 
g„  -vo ad‘  f.  j , )( -t o  i. •  .{ad‘~f.  g„,)(x0)}  is  of  di¬ 

mension  n. 

If  x0  ts  a  critical  point  (or  equilibrium  point!  for  /.  then 
an  open  neighborhood  or  .t m  M  can  be  reached  b> 
trajectones  of  the  system  ( 1 )  starting  at  x0. 

Our  next  theorem  was  first  proved  in  [6]  and  [7]  and 
improved  by  Bacctotu  and  Stefam  [1).  We  assume  that  the 
vector  space  dimensions  of  lA  and  L\  are  constant  on  M. 

Theorem  3.2.  Let  U  be  the  smallest  open  subset  of  M 
containing  x3  in  its  closure  such  that  dU  contains  the  <2* 
k-dimensional  integral  manifolds  of  L'A  that  intersect  it.  If  / 
points  in  the  direction  of  U  on  3t/,  then 

U  C  { region  of  reachability  from  x0)  C  intenor  of  U. 

Conversely,  if  U  is  the  region  of  reachability  from  x0,  then 
3L’  contains  the  k-dimensional  integral  manifolds  of  L'A 
which  intersect  it,  /  points  toward  V  on  3 U,  and  V  contains 
x0  in  its  closure. 

Let  N  be  any  connected  (n  —  l)-dimensional  manifold 
(not  necessarily  differentiable)  consisting  of  the  union  of 
an  integral  manifold  S  with  ether  integral  manifolds  of  L\ 
near  Ar.  If  every  integral  manifold  N  of  L'A  in  M  contains  a 
point  x  which  has  the  following  propeny,  then  the  system 
(1)  is  controllable  by  Theorem  322.  There  is  an  open 
neighborhood  W  of  x  in  M  with  N  separating  W  into 
disjoint  nonempty  open  sets  O,  and  02  (with  N  being  the 
common  boundary  of  O,  and  Oj  in  W)  so  that  /  points 
toward  0,  at  some  x^ENC\W  and  /  points  toward  02  at 
some  x2e  N fW.  However,  if  /  has  a  critical  point  at  x,  if 
there  is  a  neighborhood  W  of  x_with  A^_  N,  0,,  and  0:  as 
above,  and  if  /  points  toward  O,  (or  0:)  at  all  points  of 
A'n  W.  then  the  system  is  not  locally  controllable  along  <p. 

In  our  next  theorem  the  vector  field  /  is  defined  in  the 
following  way.  If  x  is  a  critical  point  of  /,  then  let  f  =  /.  If 
x  is  a  point  where  /  is  nonzero  and  is  tangent  to  the  integral 
manifold  .V  of  L'A  through  x,  then  for  A,, -  •  •,hk  a  basis  for 
L'a  near  x,  there  exist  constants  c,.-  •  -,ck  so  that/-c,A,  - 
c2A2  -  ...  —  ckhk  is  0  at  x.  We  set 

/  =  /-  2 

i  =  ! 

Theorem  3.3.  Suppose  every  integral  manifold  N  of  L'A 
in  M  contains  a  point  x  where  /  is  tangent  to  A'  and  there 
exist  a  basis  hv-  ■  -  .hk  of  L'A  near  x  and  integers  /,,•  ■  -  ,lk 
such  that  the  space  spanned  by 

{h}(x)%-  ■  ■  .hk(x).(adf.  h,)(x),-  ■  ■  .(adf.  hk)(x).  . 

(ad1'/.  h,)(x).-  ■  ■  ,\  ad‘,f.  j(x'l) 

has  dimension  n.  Then  the  system  (1)  is  controllable. 

Proof:  We  examine  the  system 

A 

x(r)  =  /(x(/))-  2  u  (t)h,(xU))  (2) 


hunt:  sufficient  conditions  for  controllability 


where  the  u ,  are  unlimited  controls.  With  /  as  defined 
earlier  the  point  x  is  a  critical  point  for  the  system 

•*(')  =  /(*('))  +  2  «,(0M*(0)-  (3) 

<  =  ! 

From  Theorem  3.1  we  deduce  that  the  system  (3)  is 
locally  controllable  along  the  trajectory  with  all  u ,  =  0.  By 
the  comment  after  that  result,  an  open  neighborhood  of  x 
in  M  is  reachable  from  x.  Given  an  open  neighborhood  W 
of  x  and  a  (n  -  1 /-dimensional  manifold  N  separating  W 
into  open  sets  O,  and  02  as  before,  we  must  have  that  / 
points  toward  <9,  at  some  x, £  Nf\W  and  toward  02  at 
some  x2€  NCiW  by  a  previous  remark.  Since / and  /  differ 
by  vector  fields  in  the  tangent  bundle  of  L'A  near  x,  the 
same  is  true  for  /  itself.  From  statements  in  the  paragraph 
preceding  this  theorem,  we  have  that  our  system  (1)  is 
controllable  on  M. 

Remark.  Given  any  point  x0  in  M,  the  research  of 
Sussmann  and  JurdjCvic  [10]  and  Krener  [8]  implies  that 
there  is  an  open  set  in  M  which  is  reachable  from  x0  (a 
local  result).  Hermes  [3]  shows  that  under  the  assumptions 
of  Theorem  3.1  for  2  critical  point  x0  of  /,  this  open  set  is 
actually  an  open  neighborhood  of  x0  (again  a  local  result). 
The  important  ideas  in  Theorem  3.3  are 

(a)  that  the  points  x  where  /  is  tangent  to  the  integral 
manifolds  of  L'A  can  be  treated  by  Hermes’  theory  to  yield 
local  controllability,  and 

(b)  the  local  to  global  capabilities  of  Theorem  3.2  are 
applied  to  prove  a  result  on  global  controllability  which 
depends  on  examining  certain  Lie  brackets  at  only  those 
points  where  /  is  tangent  to  the  integral  manifolds. 

In  general,  global  theorems  are  difficult  to  prove,  and 
proceeding  from  the  local  to  the  global  is  quite  a  problem. 

We  have  the  following  corollary  to  Theorem  3.3. 

Corollary  3.4.  Let  g,,-  ■  -,g„  be  involuiive  on  M.  Sup¬ 
pose  every  integral  manifold  N  of  g,,-  ■  -  ,gm  in  M  contains 
a  point  x  where  /  is  tangent  to  fJ  and  there  exist  integers 
so  that  the  space  spanned  by  {g,U).- •  -,gm(x), 
(adf,  i X •* ).  •  ••  ( adf ,  gmX*),  •  (ad‘'f,  g,X*).--, 

(ad'~f,gm)(x)}  has  dimension  n.  Then  the  system  (1)  is 
controllable. 

As  an  application  of  our  theory  we  present  the  following 
example  suggested  by  Meyer  [9]. 

The  state  space  is  four  dimensional  but  2  axis,  so  that 
the  state  x  =  (yl.y2)  and  both  y,  and  y2  are  two  dimen¬ 
sional  with 


_  f  x,  ]  _f*3 

M*«J-  * 


The  vector  y,  is  the  position  vector  in  two  dimensions  and 
the  velocity  is  generated  by  >\  through  two-dimensional 
rotation,  given  by  the  matrix 

£_  cos  8  sin  8  j 

L  —  sin  5  cos0j 

where  the  angle  8  is  a  function  of  the  Euclidean  distance 
II _>■,!!  from  the  origin  in  R:.  The  control  u  is  2  axis  and  the 


state  equation  is 

x,  (cos0)x3  +  (sin#  )x4  To  0 

x2  (  —  sin#  )x3  +  (cos#)x*  |0  0 

x3  =  0  +w>  1  +u>  0 

.*4]  L  0  L°J  L1. 

=  f(x)+u]g  ,(x)+  l/2g2(x) 

with  M  —  R4.  Now  g,  and  g2  are  involutive  on  R*,  and  /  is 
tangent  to  the  integral  manifolds  of  g,  and  g2  if  x3  =  x4  =  0. 
Certainly  every  integral  manifold  N  of  g(  and  g2  contains 
such  a  point  and  /  =  /  at  all  such  points. 

Computing  Lie  brackets  we  find  that 

f  cos 6  ]  f  sihtfl 


(adf,g  ,)=  - 


cos  6 
—  sin  6 
0 
0 


and  {adf,g2)=- 


Thus  gt,  g2,  (adf,  g,),  and  ( adf,g2 )  are  linearly  indepen¬ 
dent  when  x3  =  xt  =  0,  and  by  Corollary  3.4  our  system  is 
controllable. 

In  the  following  example  the  control  vectors  are  not 
involutive  and  Theorem  3.3  must  be  applied. 

Let 


=  /(*('))+  2  «/(0f,(*(0) 

1  =  1 

where  M  =  Ri. 

Computing  Lie  brackets  we  find  that 

r°i  [0 

0  0 


0 

I 

0. 


implying  that  the  vector  space  dimension  of  L'A  on  M  is  3. 
An  appropriate  basis  for  UA  is 


and  A3  = 


The  vector  field  /  is  tangent  to  the  integral  manifolds  of  LA 
if  and  only  if  x4  and  x5  are  integer  multiples  of  w.  Cer¬ 
tainly  every  integral  manifold  N  of  L'A  contains  infinitely 
many  such  points. 


We  have 


[/■*:)=- 


r  cosx4 ’ 

I  0 


and  l/.  A,  ]  =  - 


0 

COSX} 

0 

0 

0  . 


Since  the  computations  with  /  and  /  yield  the  same  results 
for  this  example,  we  find  that  {h,(x),hz(. x).  A,(x). 
( adf.  /i ,  x *  U  adf.  h:)(x)}  span  a  five-dimensional  space 
when  x4  and  x.  are  integer  multiples  of  rr.  Thus  our  system 
is  controllable. 

A  much  more  difficult  problem  is  encountered  if  there 
exists  an  integral  manifold  N  of  L'A  which  does  not  contain 
a  point  x  where  /  is  tangent  to  N. 
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Abstract 

Consider  the  tine-varying  nonlinear  system  of 
m 

the  form  x(c)  -  f(x,t)  +  £  •  with 

i-i 

f.gj,  •  •  • ,  gj,  being  Vm  vector  fields  on  S19*1. 
We  give  necessary  and  sufficient  conditions  for  this 
system  to  be  transformable  to  a  rime-invariant  con¬ 
trollable  linear  system.  In  orde'  to  control  the 
nonlinear  system,  we  map  to  the  linear  system,  choose 
a  desired  control  there,  and  returr.  to  the  nonlinear 
system  by  the  inverse  of  the  transformation. 

I.  Introduction 

Suppose  we  wish  to  control  a  nonlinear  system 
of  the  form 

m 

x(t)  -  f(x,t)  «  £  UiCOgifx.t)  (1) 

i-i 

where  f,  g1(  .  .  • >  gm  are  complete  vector 

fields  on  Rn  x  R,  f  (0,0  •  0  for  all  t,  and 
Si>82>  •  •  * ■  8n  are  linearly  Independent.  If  we 
can  find  a  nonsingular  one-to-one  V  transforma¬ 
tion  that  maps  this  nonlinear  time-varying  system  to 
a  controllable  linear  time-invariant  system,  then  we 
can  use  the  control  theory  for.  the  linear  system  to 
control  the  nonlinear  system.  In  this  paper  we  con¬ 
sider  only  the  local  case,  where  the  transformation 
maps  a  neighborhood  cf  the  origin  in  Rn+pl  to  a 
neighborhood  of  the  origin  in  Kn+m  for  each  t. 

Since  every  controllable  linear  system  has  as 
invariants  the  KronecRer  indices  and  with  these  a 
3runovsky  (!]  canonical  form,  in  order  to  map  a 
nonlinear  system  to  a  controllable  linear  system  we 
assume  that  the  linear  system  is  in  Brunovsky  form. 
From  this  point  of  view  we  may  cake  an  arbitrary 
set  of  Kronecker  indices  ....  <m,  with 

‘:i  -  1  •  •  •  >  ^m’  and  use  t*ie  associated  linear 

system  as  our  target. 

♦Author  on  leave  from  Department  of  Mathematics, 
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We  indicate  the  type  of  mappings  of  interest 
to  us.  A  transformation 

T  -  (T2,T2 . Tn+m^  maps  an  opea  set  ln 

Rn'ha+l  ((x2  ,x2,  .  .  . ,  xn,  Uj,u2 . i^.t)  space) 

containing  the  origin  in  Rn+m  onto  an  open  set  in 
®  (Tj,T2,  *  •  •»  ^n,Tn+1 ,  •  *  .,  Tn+a)  space) 

containing  the  origin  in  *.n+nl  such  that  the 
following  properties  hold: 

i)  T(0,t)  -  0  for  all  t; 

ii)  T.,T2 . Tn  are  functions  of 

x2  ,x2 ,  .  7  . ,  Xjj  and  t  only  and  have  a  nonsingu¬ 
lar  Jacobian  matrix  in  some  open  neighborhood  of 
the  origin  in  Rn  for  each  fixed  t; 

iii)  Tn+l  ,Tn+2 ,  .  .  . ,  Tn+ffi  are  functions  of 
x2,x2,  .  .  .,  x„,  u2,u2,  ....  UjB.t  and  for 
fixed  (x2,x2,  ....  xn)  near  the  origin,  the 

m  »  B  Jacobian  matrix  of  Tn+1,Tn+2 . Tn^„ 

with  respect  to  Uj,u2,  .  .  . ,  1^  is  nonsingular, 
again,  for  every  t; 

iv)  Tj.Tj,  .  .  . ,  Tn  are  the  state  variables 
and  Tn+1,Tn+2,  ....  Tn+m  are  the  controls  for  a 
linear  time-invariant  system  in  the  appropriate 
Brunovsky  canonical  form;  and 

v)  for  each  fixed  t ,  T  «  (T2 ,T« ,  .  .  .,  Tn+m) 
is  a  one-to-one  map  of  an  open  neighborhood  of  the 
origin  in  (xj,x2,  ....  xn,u1,u2,  .  .  . ,  u^)  space 
onto  an  open  neighborhood  of  the  origin  in 
(Tj,T2,  ....  Tn,  Tn+1,Tn+;,  .  .  .,  Tn+m)  space. 

Her«-  “i . “m  and  Tn+m . Tn+m  can 

be  as  large  as  we  wish. 

In  other  words,  we  want  a  local  dif feomorphism 
T  (for  fixed  t)  which  maps  system  (1)  Co  our  linear 
system.  Since  our  work  will  be  in  a  neighborhood  of 
the  origin,  we  find  it  unnecessary  to  name  specific 
sets;  we  suppose  that  all  assumptions,  conditions, 
and  results  hold  in  an  open  set  in  the  appropriate 
Euclidean  space  that  contains  the  origin.  This 
theory  can  be  combined  with  the  global  inverse- 
function  theorems  as  in  [2)  to  produce  global 
results . 

If  our  system  (1)  is  autonomous,  then  the 
following  result  from  |3]  gives  necessary  and 
sufficient  conditions  to  map  to  the  Brunovsky  form. 
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(3) 


with  indices  ....  <3,.  With  a  possible 

reordering  of  the  g's,  we  can  transform  our  non¬ 
linear  autonomous  system  to  the  linear  system  with 
indices  . <m  if  and  only  if 

a)  the  sec  C  -  {g. ,[f.g. j . 

<,-i 

(ad  f ,gv) ,g..[f,gj] . 

(ad'1  ‘f.gj) . Sa.  [f-g*].  ■  •  (ad'a"lf  ,*,) } 

spans  an  n-diaenaionai  space; 

b)  the  sets  Cj  -  {g1,[f.gl],  .  . 

(ad'3  *f.g1).gJ.[f,g1].  .  •  •• 

(ad'r^.gj) . gB.Cf.ga3 . (ad'r^.ga)) 

arc  involutive  for  j  -  1,2 . a;  and 

c)  the  span  of  each  Cj  is  equal  to  the  span 
of  Cj  n  c. 

Here  C f .gi?  1*  che  Lie  bracket,  (ad'fTg^)  ■  g£; 
(ad1f,g1)  -  ff.SiJ;  (ad1f,gi)  -  [f.Cf.gJ];  etc- 

It  is  the  purpose  of  this  paper  to  generalise 
this  result  to  our  nonlinear  tine-varying  system  (1). 
We  give  an  exaatple  of  a  constructive  proof  of  Che 
transformation.  Details,  proofs,  and  a  description 
of  hov  to  apply  the  theory  in  practice  will  appear 
elsewhere. 

Historically,  the  problem  of  transforming  a 
periodic  tine-varying  linear  system  to  a  linear  tine- 
invariant  system  is  due  to  Floquec  and  Liapunov. 
Brunovsky  [1]  gave  necessary  and  sufficient  conditions 
for  a  linear  time-varying  system  to  be  "equivalent” 
to  a  controllable  time- invariant  one.  Meyer  and 
Cicolanl  in  [4]  and  [5]  present  conditions  for  their 
nonlinear  time-varying  block  triangular  systems  to 
be  transformed  to  a  linear  time-invariant  system. 

Their  block  triangular  systems  can  be  put  in  the 
fora  (1).  The  second  author  in  {6]  has  given  a  talk 
on  the  single-input  case. 

For  transformations  of  autonomous  nonlinear 
systems  to  linear  systems  we  refer  to  the  work  of 
Keener  |7],  Brockett  [8],  Jakubczyk  and  Respondek 
IS],  and  the  authors  ([2],  [4],  [6],  and  [10]). 


t.ad  f,g)  “  g 
(ad‘f,g)  »  U'.gj 
<ad2f,g)  -  [f.Cf.gll 


(adkf.g)  -  Cf.(adk_1f.g)] 

Given  vector  fields  fj^.  ■  ■  •  •  f r  °°  K°> 

we  say  that  this  set  is  involutive  if  there  exist 
if"  functions  with 

r 

[f^.fj J (x)  -  Z  Yljk(x)fk(x).  1  <  i.  J  <  r.  i  *  J. 

k»?.  ‘  a 

If  these  vector  fields  sra  involutive  and  x„eS  , 
then  there  is  u.  unique  r-dlmensional  Vm  manifold 
S  containing  x0  so  that  the  tangent  space  to  S 
at  sach  xeS  la  tha  spaca  spanned  by 

f  i  (x) , f  2  (x) . fr(x).  That  ia,  S  la  tha 

unlqua  integral  manifold  of  f;^.  .  .  .,  fr 
through  xt.  This  la,  of  course,  due  to  the  famous 
theorem  of  Frobenius. 

For  a  Vm  vector  field  f  and  a  Vm  time- 
varying  vector  field  g,  we  define 

cr'f.g)  -  s 
(I^f.g)  *  (adlf.g)  + 

(4) 


(I*f.g)  -  (r1f,(rk”1f,g)) 

If  h  is  e  V  "  function,  we  define  the  Lie  deri¬ 
vative  of  h  with  respect  to  f  as 

Ij(h)  -  <dh,f)  (5) 

vhars  (*>’)  denoces  the  duality  bscvesn  ons  forms 
and  vector  fields.  Similarly,  we  let 

L4°(h)  -  h 
Lfl(h>  -  L£(h) 

(6) 


In  section  XI,  we  give  definitions  and  prelim¬ 
inaries,  and  section  III  concaina  our  main  raaulc 
on  transforming  nonlinear  systems. 


II.  Definitions 


If  f  and  g  are  Vm  vector  fields  on  a  mani¬ 
fold  M  (F.n  in  our  theory),  the  Lie  bracket  of 
f  and  g  ia 


L£k(h)  •  L«J(Lk”l(h))  • 
For  a  W  "  one  form  w,  we  have 


where  *  denotes  the  transpose  and  3u.*/3x  and 
3f/3x  are  Jacobian  matrices.  For  this  derivative, 

we  have 

Lf#(-)  -  *• 


vnete  rg/'?x  and  3f/3x  denote  Jacobian  matrices. 
We  define 


Lf*U)  -  L«(-i) 


(8) 
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and  matrix  5  Is 


(dVfT 1  Vi)  * -i?  •  T« 


(«<Vf  +  Z  Ui8l)+^--Tn 


(v+iu4+^'T« 


III.  Partial  Differential  Equations 

Tha  problem  of  transforming  system  (1)  to 
system  (10)  reduces  to  the  study  of  a  system  of 
partial  differential  equations.  Ue  let 

Cj  ■  tj  ■  Sj  +  tj,  •  •  • i 

rB  •  ♦  Sj,  ♦  .  .  .  +  <B  •  n;  note  that  the 

equations  In  the  following  lemma  are  derived  la  [3] 
for  the  autonomous  case. 


The  system  (1)  is  transformable  to  the  linear 
system  (10)  if,  and  only  if,  we  can  solve  the 
equations 

(dTi’8i)  *  °*  1  "  l*2*  •  •  • • 

Oj  -  l,  Oj  +  l,  .  .  . , 

^“ltCj+l.  •  •  *• 

~  1#  1  #  *  *  ■* 
n  -  I  and  i  •  1 , 2 ,  *.  .  . ,  a 

+  jt  “  1-1,2  •  •  •  • 

c.  -  1 .  •  .  .f 

c.  -I,  r.*l,  .  . 


(13) 

(concluded) 


with  the  matrix 


-a-i  ”  1 •  cm-i  +  1 > 


<«<V8>>  <dV8*>  •  •  •  <«Cl.O 

(iT0,**l)  (dTOj'*l)  •  *  •  (dTOj  •  O 

•  •  e 

•  a  • 

e  •  e 

(dTOm>*l)  (dToB‘*j)  *  *  •  (dT0#*O 
being  nonaingular. 

Using  the  Leibnitz  Rule  (9)  repeatedly,  our 
necessary  and  sufficient  conditions  for  the  exis¬ 
tence  of  a  transformation  ate  finding  functions 
Tx »TC  Tg  ^  satisfying 

1  Oh-1 

(dTx.(rJf.gt))  -  0,  J  -  0,1,  ....  cl  -  2 

and  i  -  1,2,  .  .  . ,  m 

(<!Tff^+x,(r^f ,gi))  -  0,  J  -  0,1,  .  ,  .,  Cj  -  2 
and  i  »  1,2,  .  .  . ,  o 


<dT0  ^.(^f.gj))  -  0.  )  -  0.1 . «B  -  2 

®”1  and  1  ■  1,2,  .  .  .,  a 

(1! 

such  that  the  matrix 

(dTl,(r<l'lf,g1))  .  .  .  (dTl.(r,Cl*l£.gn)) 

(dVx,(r'rlf,8i)>  •  •  •  <dVl,<r<?'lf-8m>> 


<dTc  •  •  •  <dTc 

[_  m- 1  o-i 


is  nonsingular .  This  lead3  to  our  main  result. 
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Theorem  3.2. 


The  time- varying  nonlinear  system  (1)  is  trans¬ 
formable  to  the  time-invariant  linear  system  (10) 
if,  end  only  if. 


as  we  let  s2,s,,s4  *s  vary  (with  sx  fixed) 
Since  T,  «  sx  is  constant  on  each  Integral  mani¬ 
fold,  T,  does  satisfy 

(dTi.^f.g^)  -  0,  J  -  0,1  and  i  -  1,2 


a)  the  set  C  •  (gx,(rl  £«*i> . 

(r  f»gx),g2*^  (*i2)i  •  •  *» 

(r^f.gj) . fa,<rJ  f,fa) . 

(rKn-1 , g  )}  spans  an  n-dimensional  space  for 
each  t; 


b)  the  sets  Cj  *  {gx,  O^f.g,) . 

(r'ci"*f.g1).gJ,(r1f ,gj),  .  .  ., 

(rlfJ~,f,g2) . g..(rlf.gB) . 

(rKJ  *f,g  )}  are  involutive  for  each  t  and 
J  ■  1,2,  .  .  . ,  a;  and 

c)  the  span  of  each  C,  is  equal  to  the  span 
of  Cj  P.  C  again  for  everyJ  fixed  t. 

We  show  a  constructive  proof  of  a  solution  to 
(IS)  and  (16)  under  the  hypotheses  of  Theorea  3.2 
and  the  assuisptlon  that  n  -  5,  <x  •  3,  and  x2  «  2. 
We  Introduce  real  parameters  sx,  s2,  s,,  s4,  s5. 

let  x(sx)  be  the  solution  to  the  systea 

#•  -  (r*f.gx)  (17) 

satisfying  x(0)  -  (0,0,  .  .  .,0),  the  origin  in 
Hs.  Next,  we  denote  by  x(s2,s2)  the  solution  of 

4~  m  <rlt*«i)  08) 

us  2 

with  x(Sj,0)  ■  x(sx).  The  5-tuple  x(sx,s2,s,)  is 
the  solution  to 


—■  -  (rlf ,g2)  d9) 

with  x(s1,s2,0)  »  x(sl,s2).  Continuing,  we  solve 
in  order 


£ -•«»  *nd  irr*'  (20) 

to  find  a  function  x(s2 ,s2  ,s, ,s4 ,ss) .  Froa  the 
hypotheses  of  the  theorea,  we  conclude  that  the 
Jacobian  matrix  of  the  function  with  respect  to 
s1,s,,s,,sk,a$  is  nonsingular  at  the  origin.  By 
the  inverse  function  theorea,  we  can  solve  for 
s,  ,s2  ,s, ,sk ,s,  at  functions  of  xx ,x2,x, ,xk ,x,. 

We  note  chat  x  (sx ,s2  ,s,  ,sk  ,s,)  is  also  s  function 
of  c,  and  this  inversion  is  done  for  esch  fixed  c. 

Thus,  if  we  can  choose,  in  the  i, ,s2 ,s, ,sk ,s , 
space,  solutions  Tx  and  Tk  to 

<dTj,(rJf,g  ))  -  0,  j  -  0,1  and  i  -  1.2 

,  v  (21) 
<dTk,(r-lf,g1))  -  0,  J  -  0  and  i  -  1 ,2 

then  we  have  a  transformation  (for  our  choices  the 
matrix  (16)  will  be  nonsingular).  We  let  T,  ■  s , 
and  Tk  •  s,.  Since  C,  C  is  Involutive  for 
fixed  t,  we  get  an  integral  manifold  of  C,  C 


Also,  since  C2  n  C  is  involutive  with  t  fixed, 
letting  s„  and  ss  vary  (with  82,82,$,  fixed), 
we  find  an  integral  manifold  of  C2  n  C.  Our 
choice  T%  -  s,  is  constant  on  each  such  integral 
manifold  and 

<dT*-*i)  “  0  £or  1  *  1*2 

Example  3.3. 


Co naider  the  system 


— 

—  - 

r~  — 

■  - 

*1 

sin  x2 

0 

0 

** 

sin  X| 

0 

0 

*» 

- 

0 

+  u. 

1  +  t 

+  u2 

0 

x,. 

X,  +  Xks 

0 

0 

X, 

0 

0 

1 

LL 

_ 

on  U  *  V  C  F5  *  B,  where 


Computing,  we  have 


[f.8xl 


-(1  +  t)cos  X, 
0 
0 
0 


(adJf ,gx)  - 


cos  x2  cos  x, 
0 
0 
0 
0 


.  l£. 8:1 


t°j 


(23 


Thus 


(Tlf ,gx)  - 


-(1  +  t)cos  X, 


(24 


I  0 

I 

j  0 

.  (r1f ,*,)  -jo. 

i  -1  : 

L°J 

(24) 

(concluded) 

Checking,  we  find  that  the  hypotheses  of  Theorem  3.2 
are  satisfied  la  a  neighborhood  of  cho  origin. 
Solving 


(r*;. 8l) 


(1  +  c)  (co«  X}  C0«  X,) 
-2  cos  x, 

0 

0 

0 


dx 

d*a 


dx 

d*2 


<r‘f.«x) 


(r^.g,) 


dx 

*•» 


It 


(25) 


In  order,  wo  find 

sin(2s, ) 

Xj  -  (1  +  c)  . y 

Xj  -  -(1  +  c)(*la  s2)  -  2*1 
x,  •  (1  c)a%  ♦  a2 
*.  - 


Thus,  wo  hsv* 


•i 


l 

7 


arc  sin 


x,  +  arc  sin  777- 
-(1  +  t) 


x,  -  arc  sin 


(«. +  ,rc  ,in(rr?)) 
V  -a  1 5  — ! 


I  +  c 


Wo  than  daflna 

1  (  2x> 

Tx  -  j  arc  sin  ^777 

T„  -  -x^  . 

Tht  functions  T1#T,,Tf,T, ,  and  T7  can  bo  found 
iron  equations  (13). 

IV.  Conclusions 

We  have  nantionad  results  that  classified 
those  nonlinear  Cl no -varying  sys tens  like  system  (1] 
which  can  be  transformed  to  controllabls  linear 
time-variant  systems.  Ve  also  outlined  a  aethod  of 
constructing  such  a  transformation  In  che  case 
*2*3  and  Kj  -  2  on  X*. 
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Data  base  for  symbolic  network  analysis 

C.-C.  Wu,  M.S.,  and  Prof.  R.  Saeks,  M.S.,  Ph.D.,  Fel.  I.E.E.E.,  P.E. 

Indexing  terms:  Linear  networks.  Transfer  functions 

Abstract:  A  data  base  for  generating  the  symbolic  transfer  functions  f6r  a  linear  electronic  circuit  is  formu¬ 
lated  and  an  appropriate  retrieval  theorem  derived.  The  size  of  the  required  data  base  is  O(n’)  independently 
of  the  number  of  simultaneously  varying  parameters,  where  n  is  the  total  number  of  component  output 
terminals,  and  the  cost  of  retrieval  is  O(p’)  multiplications  where  p  is  the  actual  number  of  circuit  parameters 


which  vary  simultaneously  in  a  given  analysis.  As  such,  both  storage  and  computational  requirements  are 

minimised. 

List  of  symbols 

Matrix 

Type 

Dimension 

Index 

a  = 

composite  component  input  vector 

m  x  1 

— 

b 

composite  component  output  vector 

n  x  1 

— 

u  = 

composite  system  input  vector 

PX  1 

— 

y 

composite  system  output  vector 

qx  1 

— 

Lix  = 

connection  matrix 

m  x  n 

— 

Lj  i  * 

connection  matrix 

q  X  n 

— 

L\x 

qth  row  of  Lai 

1  x  n 

Q  *  1,2,-.. 

.q 

Li2  = 

connection  matrix 

m  x  p 

— 

tnh  column  of£u 

m  x  1 

*-1.2.... 

Z-u  * 

connection  matrix 

q  x  v 

— 

L% 

q-v  entry  in  Ln 

1  x  1 

Q  =  1  *  2,  •  •  • 

S 

composite  system  transfer  function  matrix 

q  x  v 

— 

S9" 

q-v  entry  in  5 

1  x  1 

?  *  1,2,... 

z 

composite  component  transfer-function  j 
matrix 

A  X  JR 

" 

Z* 

nominal  composite  component  transfer- 
function  matrix 

A  X  JR 

2. 

composite  component  transfer  function 
perturbation  matrix 

A  x  m 

— 

c* 

column  vector  characterising  pertur¬ 
bation  of  *th  parameter 
array  of  the  ck  vectors  for  the  parameters 
which  actually  vary  (row  [c*] ) 

A  X  1 

*  =  1,2,.. 

C  m 

nx  p 

— 

r*  = 

row  vector  characterising  perturbation  of 
*th  parameter 

1  X  JR 

*=1,2... 

R  = 

array  of  r*  vectors  for  the  parameters 
which  actually  vary  (col  [r  ] ) 

p  X  JR 

•  ,* 

bk 

*th  variable  parameter 

1  X  1 

*=1.2... 

& 

array  of  6*s  for  parameters  which  actually 
vary  (diag  [6*]) 

pxp 

1  Introduction 

transfer  function 

is  required.  Of  course. 

the 

achieved  via  such  an  approach  is  dependent  on 

Historically,  symbolic  network  analysis  has  been  motivated  by 
the  problems  of  circuit  design  and,  as  such,  the  emphasis  has 
been  placed  on  quickly  and  efficiently  obtaining  a  symbolic 
transfer  function  from  a  given  set  of  circuit  specifications  [2,3]. 
In  an  operational  or  maintenance  environment,  however,  one 
is  typically  given  a  prescribed  nominal  circuit  and  desires 
to  determine  the  effect  of  various  (possibly  large)  perturbations 
thereon.  This  is  the  case  in  a  power  system  where  one  is  given 
a  fixed  network  and  desires  to  determine  the  effect  Of  proposed 
modifications  thereto.  Alternatively,  in  the  problem  of 
analogue  circuit  fault  diagnosis,  one  desires  to  simulate  the 
effect  of  a  number  of  alternative  failures,  to  compare  the 
simulated  data  with  the  observed  failure  data  [4] . 

In  such  an  operational  or  maintenance  environment, 
numerous  perturbations  of  the  nominal  circuit  are  studied  and, 
as  such,  significant  computational  efficiencies  can  be  obtained 
if  one  first  generates  a  data  base  in  terms  Of  the  nominal  circuit 
parameters  and  then  extracts  the  appropriate  symbolic  transfer 
function  from  the  data  base  each  time  a  different  symbolic 
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The  author!  are  with  the  Departments  of  Electrical  Engineering,  Tenas 
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data  base  and  the  ease  with  which  a  symbolic  transfer  function 
may  be  retrieved  therefrom. 

The  obvious  manner  in  which  to  generate  such  a  data  base 
is  to  simply  precompute  the  coefficients  of  all  required 
symbolic  transfer  functions  and  store  them  in  the  data  base. 
Retrieval  from  such  a  data  base  is,  of  course,  immediate,  but 
the  data  base  may  become  overly  large.  Indeed,  the  number  of 
transfer  functions  which  must  be  stored  is  0 (**)  where  k  is  the 
total  number  of  potentially  variable  circuit  parameters  andp  is 
the  maximum  number  of  circuit  parameters  which  may  vary 
simultaneously .  An  alternative  approach  is  to  store  the  nominal 
transfer  function  information  and  then  use  Householder  s 
formula  [1]  to  compute  the  required  symbolic  transfer  func¬ 
tions.  In  such  a  data  base  we  need  only  store  0(nJ )  transfer 
functions,  where  n  is  the  total  number  of  component  output 
■terminals,  but  retrieval  requires  0 (n3+pJ)  multiplications, 
where  p  is  the  actual  number  of  circuit  parameters  which  vary 
simultaneously.  Since,  in  practice,  n^p,  the  retrieval  process 
requires  approximately  0(n5 )  multiplications  and  is  dominated 
by  the  large  dimensional  matrix  multiplication  required  by 
Householder's  formula  rather  than  the  low  dimensional  inverse. 

In  the  present  paper,  we  will  formulate  an  alternative  data 
base  for  the  svmbolic  transfer  functions  which  also  requires 
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multiplications.  Since  p  is  typically  small,  this  is  tantamount 
to  immediate  retrieval. 

In  the  rest  of  this  introduction,  we  will  review  the  properties 
of  the  component  connection  modei  for  a  large-scale  circuit  or 
system  [1,6,7]  which  serves  as  the  starting  point  for  our 
theory.  The  data  base  and  retrieval  formulas  for  the  case  where 
p  <  2  are  formulated  in  Section  2  and  the  general  retrieval 
formula  is  derived  in  Section  3.  Section  4  is  devoted  to  the 
problem  of  retrieving  sensitivity  formulas  frcm  the  data  base 
-and  Section  5  deals  with  the  problem  of  updating  the  data 
base  when  the  nominal  circuit  parameters  are  changed.  Finally, 
Section  6  is  devoted  to  seven!  examples  illustrating  the  theory. 

The  component  connection  model  is  an  algebraic  model  for 
an  interconnected  dynamical  system  which  subsumes  the 
classical  topological  models  but  is  more  readily  manipulated 
both  analytically  and  computationally  [1,6,7].  The  motiv¬ 
ation  and  justification  of  the  model  are  discussed  in  detail  in 
Reference  1  and  will  not  be  repeated  here.  The  component 
’  connection  model  takes  the  form  of  the  set  of  simultaneous 
equations: 

.  b  «  Z(J<J)a  (1) 

a  m  Lt,b  +  Lau  _  (2) 

and 

*  v  *  l2ib  +  Lau  (3) 


u*uy  vartaoie  component  parameters  though  at  most  p  such 
parameters  vary  in  any  given  analysis  ;p  <p<k  Indeed,  p  <* 
in  most  applications.  Finally,  we  note  that  Z,  can  be  expressed 
more  concisely  in  the  form 


z  * 

C&R 

(9) 

where 

C  • 

[c1 1  eJ  •  . 

00) 

V 

R  » 

.i 

t 

01) 

I'M 

and 

S' 

4-  82  .  02) 

S' 

m  m 

The  above-described  notation,  formulated  for  the  component 
connection  model,  is  summarised  in  the  list  of  symbols; 


Here,  Z(»Z(ju))  is  a  frequency-dependent  matrix  charac- 
ising  the  decoupled  system  components  with  composite 
component  input  and  output  vectors  e  and  b,  respectively.  On 
the  other  hand  the  i,j  ■*  1,2;  matrices  are  frequency- 
independent  connection  matrices  characterising  the  coupling 
between  the  composite  component  vectors  a  and  b  and  the 
composite  system  input  and  output  vectors  u  and  y,  respect- 
.ively.  . 

A  little  algebra  with  the  component  connection  equations 
will  readily  reveal  that 

S  -  Ln+Lu(\-ZLn)"ZLa  (4) 

where  5{»  S(ju))  is  the  composite  system  transfer  function 
matrix  [  1  ]  characterising  the  external  behaviour  of  the  system 
via 

y.  -  S(Ju)u  (5) 

Often,  rather  than  working  with  the  entire  S  matrix,  we  find  it 
convenient  to  work  with  its  individual  entries;  x**,  q  ■ 

1,2 . q  and  1,2, . . . ,v.  which  are  related  to  the 

component  connection  model  via 

«  L1J  +If,  (1  -ZZ.,,)*'  ZL'n  (6) 

Here  Iff  is  the  q-v  entry  in  La ;  q  m  1, 2 . q  and  v  ■ 

1, 2, ...  ,r.  If,  is  the  qth  row  of  Lu  ;q  •  1,2, ...  ,q\  and 
L\-  is  the  nh  column  ofI,j;n«*  1, 2, . . .  ,v. 

Finally,  since  we  are  interested  in  analysing  the  effects  of 
perturbing  one  or  more  components  from  their  nominal  values, 
we  decompose  Z  into  nominal  and  perturbation  terms  in  the 
form 

Z  »  Z,  +  Z,  (7) 

where 

Z.  «  £  c*oV  (8) 

*•! 

Here.  <*  {»  ck  (/<-)}  is  a  column  vector,/1*  {» /*(/w))  is  a  row 
vector,  and  6*  is  the  scalar  perturbation  for  the  in. h  potentially 
variable  component  parameter.  In  a  typical  application  one  is 
given  <* ,  r* .  and  ok:  k  »  1.2 . k,  characterising  i  poten- 


2  Data  base 


Our  data  base  is  composed  of  the  following  family  of  (fre¬ 
quency  dependent)  scalar  transfer  functions: 


-  L$  +  L\x(  1 

-Zo^.rzoiL 

q  -  1.2... 

.,q\v  -  1,2,... 

03) 

b" 

-If.O-ZoL, 

q  -  1,2,.. 

,q;f  •  i,2,... 

(14) 

dkD 

-  r*[l+Z„(l 

-ZoZ-nrZoK 

* 

12 

J 

*  -  1.2,... 

,/r,  v  m  1,2,... 

05) 

Q 

-  f*Ill(l-Z0l..r,c/ 

ktj  *  1  t  2y  • 

...k 

(16) 

Here,  q  and  v  denote  the  number  of  external  system  inputs 
and  outputs  which  are  typically  few  in  number.  As  such,  the 
e*  array,  composed  of  k*  entries,  dominates  the  data  base. 
Also  note  that  all  the  entries  in  the  data  base  are  formulated  in 
terms  of  the  nominal  component  values  and,  as  such,  the  data 
base  may  be  generated  off-line  without  a  priori  knowledge  of 
the  perturbations  to  be  analysed.  Finally,  the  entire  data  base 
may  be  generated  with  the  aid  of  only  a  single  u  by  n  (sparse) 
matrix  inverse. 

Now,  if  we  assume  that  only  a  single  parameter  is  perturbed, 

i.e. 

Z,  -  c*5V  (17) 

for  some  fixed  it  »  1,2,  ...,*,  to  retrieve  s9”  from  the  data 
base  we  must  evaluate 

r«*  -  IS +  L?1(l-[Z0+c*SV ]£„)-' 

|Ze  +cWU?a  08) 

in  terms  of  the  elements  of  our  data  base  and  ihe  variable 
parameter  Sk.  To  this  end,  we  invoke  Householder's  for¬ 
mula  [1 ) 


(W  +  XY)~l  =  Wl  -W'X(l  +  YW-'Xf'YW'' 


(19) 


© 


A:  =  -ck8k  Y  =  r*!,, 


with 

W  =  (1  -Z0In) 

obtaining 

(1-(Z0+  cW] Lnr  =  [(1  — Z0Z.n )  +  (— c*5fc)  (r^L,, )]"‘ 

=  (1  -Zol,,)-1  +(1  -Z0Lnr  ck8k{\  — r*L„(l  -Z0Lnrl  c*6  k)*‘r*Iu  (1  -Z0LnT' 


«  (1-ZoI,,)-*  + 


.  (l-ZoInr^SVlnd-ZoLn)-* 


1  -$V* 

Now,  on  substitution  of  eqn.  20  into  eqn.  18,  we  obtain 

=  aiP  +  i!,(l-[Zo+cW]Ill)-‘  [Z0  +  ck8krk]L  12 

=  LS  +  ^id-ZoL,,)-1  IZo+W]^, 

,  £?•.(!  -Z0X„)ViVl„(l  -Z0Lnyl  [Z0  +  ck8krk]L\1 

1  -5Vfc 

5*feqfcr*I„(l  -Z0LnTlZ0Lun  +  (Sk)7bqkekkrkLu[2 

1-6V* 


=  sr + s  wij,  + : 


=  jy + 


8W“  +  (5*)*  +  fr«),e,,*r*Iif2I 


1  -if 


* 


=  sy  + 


6  kbqkdkv 


1  -6"e 


Jr  „kk 


(21) 


which  is  the  desired  symbolic  transfer  function. 

If  we  assume  that  two  parameters  are  perturbed,  i.e. 


Z,  -c*«V+cW 


(22) 


a  similar  formula  can  be  obtained,  wherein  Householder’s 
formula  is  applied  twice.  Since  this  formula  is  subsumed  by 
the  general  retrieval  formula  derived  in  the  following  Section, 
we  simply  state  the  result  without  proof.  In  particular 


sQV  =  L\\  +  L\x  (1  -  [Z0  +  ck 5 V  +  cW]  Luy'  [Z0  +  ck6krk  +  cW]  L\2 

8kbqkdkv  +  5ibqidiv  +  5*6y  (—  ekkbqidjv  -  eiibqkdkv  +  ekibqidiu  +  eikbqidkv) 
1  -  Skekk  -  5Vy  +  6h8J(ekkeji  -  eV1 


=  s8"  +  ' 


•  3 


As  is  apparent  from  eqn.  23,  our  retrieval  formulas  are  quite 

bn 

b'7  . 

..  b'p 

complex,  even  for  the  case  p-2  and,  as  such,  a  more  compact 

B  = 

b7' 

b73 

..  b7p 

notation  is  required  if  they  are  to  be  tractable.  To  this  end,  we 

* 

assume  that  5* ;  Ar  =  1 , 2 , . . . ,  p  .denote  the  potentially  variable 

parameters,  and  that 

b" 

b*7  . 

. .  v* 

Z,  =  ck8krk  =  CAR  (24) 

du 

d17  . 

.  .  dlv 

Of  course,  the  same  expression  applies  to  any  set  of  p  poten¬ 

d7' 

d72  . 

.  .  d7v 

tially  variable  parameters,  given  an  appropriate  change  of  the 

D  = 

index  set.  To  obtain  the  required  symbolic  transfer  function 

for 

_<?' 

dp 7 

dpv 

S  =  Z-22  -i-Z.2l(l  (Z o  •+■  Z i  ]  £. 1 1 )  1  [Z0  +  Zi  ]  Ljj 

> 

e'7  . 

.  elp 

(25) 

with  the  above  specified  Zx ,  we  now  define  the  following 

E  » 

e7' 

e 77  . 

.  e7p 

matrices  made  up  of  elements  from  our  data  base: 

i-  “i 

fx 

ep7  . 

'  e>>P- 

So  = 


,ii 

to 


J0 


,:i 

to 


.is 

to 


to 


(20) 


(23) 


sV  fX1 


t?” 


(26) 


(27) 


(28) 


(29) 
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and  A  is  as  defined  by  eqn.  12. 

Theorem 

Using  the  above  notation,. 

5  =  L22+L2l(\-\Z0Zl)Ln)  1  [Z0  +  Z,  ]  Z.  |2 
=  S,  +  5,1  -Af)*1  A D 


Proof:  First,  we  observe  that 

Sq  ~  L^2  ~ZqLh)  ~  ZqL\2  (jO) 

is  just  the  nominal  system  transfer  function  matrix,  and 
B  =  I2,(l  -Z0LU)-'C  (31) 

and 

-  D  —  R[i  *L|j('!  —Z0Ln)  1  Z0  \  Lu 

-  *(1  -LuZ0Tl  (32) 

via  Householder's  formula.  Finally, 

E  ®*  RE ii (1  ~ ZqL ii )  'C 

where  R  and  C  are  as  defined  by  eqns.  10  and  1 1 .  As  such, 

(1— A £)-'  *  (l*Mi„(l  -ZoLu r'CT1 

=  [l+A«I1I(l-(l-ZoIl,r,CM£llrl* 

(l-ZoIuTC] 

*  [1  +  ARL||(1  ~Zoiii  ~~ZiLn)  '  C] 

*  U  +  ERLud-ZLur'C]  _  (33) 

where  we  have  invoked  Householder's  formula  with  Z  *  1 , 
JC  *  A/LZU ,  and  T  =  (I  —  Zoln)*1  C;and  eqn.9.  As  such, 

So  +  B{  1  -  A£T' AZ>  *  S0  +  L2i(1  -Z0Ln)‘l  x 

C[t  +  A/U„(l  -Zt„r*C]  x 

A*(  1  -L:lZ0T'Ll2 

-  S0  +Zu(l  -ZoZ,,)*1  (Z,  +Z,IU(1  -ZiuTZ,]  x 
(1-LUZ0)-*LIS 

*  So  +  Z.2i(l  '“^o^'n)  1  x 

{[(1  -ZZ.„ )  +  Z.Z,,  ]  (1  -Z£„ )-« }Z, (1  -L,,Z0)-‘I„ 

*  S0  +  Z.-j(l  —  Z0i|i )  '(1  —  Zo-f.ii )  x 
(1  ~ZL,t)  1  Zi ( 1  —LhZq)  *Z,i2 

*  Sc  +  In (1  —ZLu)  1 Z, (1  — iuZo)  '^iz 

*  Z,;j  +  Zj|(l  — Zoiu)  'ZoZ.ll 

+  ij,(l  -zz.,,)  'Z|(!  — Z|jZo)  'Z.u 

*  Z>22  +Z.;|Zo(l  —LnZ0)  lLn 

+  Z„(l-ZZlirZI(l-ZllZo)-*Z.I, 

*  Z.12  +  Z-u  [Zo  +  (1  -ZLn )  ' Zj  J  (1  “ZjiZg)  'Z|i 

*  Z2J  +■  Z;j (1  —  ZZ.ii )  1  ((1  ~ZLn)Zo  +  Zi  ]  x 
(1  -  ZuZ0)  1  Zn 

=  Zn  +  Z.«j  ( !  ZL  n )  ‘  [Z  ~  ZL 1 1 Z  o  1  x 

(1  — Zi(Z| )  ' Z. ii 

*  Z» ii  +  Zn ( 1  ~ ZL n)  'Z[l  -Z„Z0]  x 

(1  L\\  Z0 )  1 L  ii 

*  Z  ii  -'■ZiiO  —  ZLn)~l  ZLxi  *»  S 
as  required. 

4  Sensitivity  formulas 

If  one  is  working  directly  with  the  component  connection 
model,  it  is  well  known  [4]  that  the  sensitivity  of  S  with 


respect  to  a  parameter  6‘  can  be  computed  via  the  formula 

§r  -  Zein-zz,,)-'  (i~z.1,n-zzl,)-‘z!z.,s 

and  hence  it  is  appropriate  to  ask  whether  or  not  such  a 
sensitivity  matrix  can  also  be  computed  from  our  data  base. 
Since  the  expression 

S  =  S0  +B(\ -EE)’1  ED  (35) 

is  formally  identical  to  eqn.  4,  if  1  <  i  <p,  we  may  write 

Jp-  -  5(1  -EET'Mdl  +£(1  -AZJ'AjZ?  (36) 


*1-3 T 


with  the  one  appearing  in  the  xtn  diagonal  entry.  Clearly,  the 
expression  can  be  computed  directly  from  the  data  base  with 
the  same  level  of  computational  effort  as  required  for  the 
retrieval  formula. 

In  the  case  where  61  is  not  included  in  the  given  set  of  par¬ 
ameters  which  deviate  from  nominal,  i  >p  in  our  notation,  we 
must  first  augment  the  B,  £,  D,  and  A  matrices  to  include  6‘ 
and  then  apply  eqn.  36  to  the  augmented  system.  To  this  end, 
we  let 


bli 

b12  . 

..  bxp 

b21 

b22  . 

..  b2p 

bql 

b92 

b9P 

dn 

d12 

...  dlv 

dil 

d22  . 

...  d2¥ 

<?' 

dp2 

...  d9* 

d12 

...  <r 

e" 

e1J  .. 

.  e'p  t 

e2' 

e 22 

.  e:p  t 

e*' 

e*2  .  . 

.  epp  < 

e4' 

e°  .. 

.  eip  < 

Then  we  obtain  the  retrieval  formulas 
S  =  S0  +5‘(1  -Aa£i)-'Aa£>i 


6  Examples 

Consider  the  simple  RC  op-amp  circuit  shown  in  Fig.  1.  The 
component  connection 


—7  =  B\\  -  Aa£iY,Mp^l  (1  +  £'(1  -  AaEiylAa]Di 


S  Updating  the  data  base 

In  many  applications,  one  uses  a  data  base  such  as  that 
described  above, as  a  design  tool  to  aid  in  simulating  the  effects 
of  various  proposed  modifications  to  the  system.  When  such  a 
modification  is  finally  implemented,  it  is  then  necessary  to 
update  the  data  base  to  reflect  the  new  nominal  parameter 
values 

Z0  =  2<>  +  I  =  Z0  +  CAR  (44) 

With  the  aid  of  Householder’s  formula,  we  may  compute 
(1  —  Z0Lu)  =  [(1  ~Z0Z.n )  -  CARLXI ]  1 

=  (l-ZoL,,)*'  +(l-20£nr1  x 
c[ i  -m„(i -z0Luy'crl  x 
A/?Z,n(l  Z0LU ) 

=  (1  ZqL  u)  1  +  (1  ZqL  j  j  )  1  x 

CO-AfrARIuO-ZoI,,)''  (45) 
which,  on  substitution  into  eqn.  16,  yields 


e*  =  eki+  [ekleki  ...ekp]  (l-A£)*lA 


Similarly, 


b **  =  bqk  +  [b^b*2 . .  .bQP ]  (1  -A E)'1  A 


Fig.  1  RC  op-amp  circuit 
model  for  this  circuit  takes  the  form 
ic  sC  0  0]  vc 

vr  -  0  R  0  |  iT 

v2  0  0  K\vx 

ve'  ~0  -1  ~1  ic  1  ?,] 

ir  -  1  0  0  vr  +  0 


ti.J  |_0  -1  Ojp,J  [I 
v0  =  [0  0  1]  i~ I  J-  [0]  vt) 


Thus,  if  all  components  are  taken  to  have  nominal  values  of 
1 ,  we  obtain 


1  s  s 


(1  -Z0LU)  =  -1  1  0 


0  1  1 


1  0  -s 

(1  —  Z0Lxl)  1  =  1  1  — j 

.  -1  -1  s+ 1 


dhu  m  dku  +  [e'"e''5  . .  .ekp]  (1  -AfJ'A  •  (48) 


s  0  —  s 

(1  —Z0L u)  ’Z0  s  1  — s 


—  s  -  1  s  +  ! 


0  o-l 


^n0  Z0LU)  —  1  Or 

-  1  -  1  s 


s9v  =  siv  +  [ b"'bq 2  .  ,.bap]  (1  —  A£y'A 


As  such,  the  entries  in  our  data  base  can  be  updated  with  a 
computational  effort  which  is  commensurate  with  that  required 
by  the  retrieval  formula. 


•‘*'^-ii(l  Z0LU]'Z0  —  1 


Now.  v*. e  r~d'.  ’•? r 
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cn  a 


^Sr‘*£v"*V 


*.  s 


and  K  via  the  matricas 

#» 

j 

c'iM  »  0  8l  [1  0  0) 


c’5V 


»  • 

0 

I  «l  [0  1  0] 


c»SV 


0 

0  5*  (0 


Combining  the  appropriate  c*  and  r'  matrices  with  the  above 
expressions  as  per  eqns.  13  through  16,  we  obtain  the  data 
base 


*e  ■  1 
b'  -  -s 


b2  -  -1 


dl  -  1 

d 

o 

1 

rf*  - 

e“  -  0 

«»*  -  0 

*“  . 

*»  -  s 

e»  -  0 

**»  « 

r*--! 

e»  .-1 

«»  . 

b*  -  r+1 
rf*  -  1 


Finally,  if  we  desire  to  update  our  data  base  to  reflect  a 
new  nominal  value  for  the  circuit  parameters  of  C*  1,J?  »  1, 
and  /f*2,  we  invoke  eqns.  46  through  49  with  S1  *  l, 
yielding 

.  6*8*^  |  2 

*•  *  *•  i *  i-i  (6,) 


.  «"»V‘  (-|)f(-,)|  _ 

1-iV  °  1-S>(.)  1 

(68) 

and  rimilatiy  for  the  other  dements  of  the  dau  base. 

At  a  more  realistic  example,  consider  the  2 -stage  feedback 
amplifier  of  Fig.  2,  where,  as  before,  we  take  all  nominal 
component  parameters  to  be  one  to  simplify  the  illustration. 
The  component  connection  model  for  this  circuit  takes  the 
formr 


where  we  have  deleted  the  q  and  v  indices,  rince-we  are  dealing 
with  a  single -in  put/single-output  system. 

.  Now,  if  one  desires  to  compute  the  symbolic  transfer  func¬ 
tion  with  respect  to  perturbations  in  the  op-amp  gain,  we  have 
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0  *«, 
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0  1 


0  1  t>0 

1  0  «r, 

-I  “I  4. 


°J  V. 


'■  JM’) «7> 

Recalling  that  6*  represents  a  perturbation  from  a  nominal 
parameter  value  of  Kq  ■  1 ,  our  actud  gain  is  K  m  K<>  +  8*  • 
1  +  8*  which,  on  substitution  into  eqn.  65,  yldds 


(0  0  0  0  0  1 


0  0  0 


+  (0]  1*1  (71) 

Now,  after  invoking  the  indicated  operations,  the  entries  in 
our  data  base  for  q,/«  1,2,  and  3  take  the  form: 


1 

,  1  l _ L_ 

(i+lj*  (s+1)4  $  (s+l)> 


(72) 

(73) 


indicating  that  the  first  stage  of  the  amplifier  now  has  a  pair  of 
complex  conjugate  poles  at  —  1/2  +  V3/2,  as  is  expected  for 
this  circuit.  Of  course,  the  second  stage  is  unchanged,  retaining 
its  poles  at  s »  —  1 . 

7  Conclusions 

The  preceding  development  has  been  motivated  by  operational 
and  maintenance  considerations  rather  than  the  design  con¬ 
siderations.  In  such  an  environment,  one  typically  deals  with  a 
fixed  nominal  system,  but  carries  out  repeated  analyses 
thereon.  As  such,  the  cost  of  generating  the  required  data  base 
is  secondary  compared  to  the  cost  of  storing  the  data  base  and 
retrieving  information  therefrom.  In  these  respects,  we  believe 
that  our  data  base  in  near  optimal.  Since  the  number  of  system 
inputs  and  outputs  is  typically  small,  our  data  base  contains 
approximately  k3  elements  (actually  k3  +  k(v  +  q)  +  vq) 
where  k  is  the  total  number  of  parameters  which  are  potentially 
variable.  This  data  base,  however,  contains  sufficient  infor¬ 
mation  to  permit  one  to  retreive  symbolic  transfer  functions 
for  any  number  p<k  of  variable  parameters.  Indeed,  the 
number  of  variable  parameters  in  a  symbolic  transfer  function 
is  reflected  only  in  rise  cost  of  retrieval  which  is  of  the  order 
of  p*  multiplications  (actually  p3  +p3v\i  pv(q  %  1 )).  Since  p 
is  typically  small,  say  five  or  less,  this  is  minimal. 
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Note  that  die  fact  that  s0  *  l/(r+  l)4  implies  that  both 
stages  of  the  amplifier  induce  a  pair  of  poles  at  s  *  —  1 ,  as  is 
well  known  for  this  circuit. 
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I.  Introduction 

IN  PARTI  (1J.  a  theory  for  the  diagnosability  of  nonlinear 
memoryless  systems  (dc  circuits)  was  developed.1  The 
theory  consists  of  the  following  parts: 

1 )  a  necessary  and  sufficient  condition  for  the  local  diag¬ 
nosability  of  the  system, 

2)  simplified  tests  for  local  diagnosability, 

3)  it  theorem,  establishing  that,  for  a  locally  diagnosaWe 
system,  there  exists  a  finite  number  of  test  inputs  that  are 
sufficient  to  diagnose  the  system, 

4)  sufficient  conditions  for  single  fault  diagnosability. 
The  main  contribution  of  this  paper  is  to  develop  a  similar 
theory  for  the  diagnosability  of  nonlinear  dynamical  circuits 
and  systems.  Similar  to  [1],  we  work  with  an  input-output 
model  of  the  system.  Unlike  the  memoryless  case,  however,  the 
inputs  and  outputs  are  in  this  case,  functions  of  time  and  are 
hence  properly  considered  to  be  elements  of  an  infinite  di¬ 
mensional  Hilbert  space.  However,  since  the  parameter  space 
remains  finite  dimensional,  in  spite  of  the  finite  dimensional 
setting,  the  theory  yields  a  finite  dimensional  test  matrix.  Our 
objective  here,  therefore,  is  to  indicate  a  mechanism  whereby 
the  existing  diagnosability  theory  for  linear  systems  [2]  and 
nonlinear  memoryless  systems  [l]  can  be  extended  to  the 
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1  A  brief  review  of  the  analog  fault  diagnoeis  literature  is  available  in 
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general  nonlinear  dynamical  case.  The  sequence  in  which  we 
present  our  results  parallels  that  of  [1]. 

II.  Problem  Formulation 

For  a  nonlinear  dynamical  system,  given  an  input  waveform, 
a  set  of  initial  conditions  for  the  states  of  the  system,  and  a  set 
of  parameter  values,  the  output  waveform  is  uniquely  defined. 
For  simplicity,  we  assume  that  the  initial  states  are  fixed  at 
zero  (if  they  are  unknown  they  can  be  subsumed  into  the  pa¬ 
rameter  set),  and  that  all  measurements  are  taken  in  a  fixed 
time  interval  [0, 1  ].  Specifically,  we  consider  systems  with  p 
input  terminals,  q  output  terminals,  and  k  parameters  that  can 
be  described  by  the  equation 

y-/(u,cr)  (1) 

where,  ue  t/cP[0,l]',  the  space  of  Revalued  piecewise 
continuous  functions  oftime  on  the  interval  [0,  l],y  e  K»P 
(0, 1)*,  and  o  e  A  c  R*;  A  is  open.  Note  that  with  an  ap¬ 
propriate  inner  product,  Y  is  a  Hilbert  space  over  the  field  of 
real  numbers  [4] .  Also,  we  assume  that  f:  U  X  A  -*  Y  is  con¬ 
tinuous  in  u  and  continuously  (Frechet)  differentiable  [S]  in 
a. 

At  this  point  we  introduce  a  simple  example  (see  Fig.  1), 
which  we  will  carry  along  with  us  to  illustrate  the  various 
definitions  and  results  in  this  paper:  Note  that  the  circuit 
contains  both  linear  and  nonlinear  resistors  and  inductors,  and 
as  such,  there  exists  no  systematic  technique  to  test  the  diag¬ 
nosability  of  this  circuit.  The  voltage  u(-)  is  the  input,  while 
the  measured  output  is  the  current  y(  ).  The  various  branch 
relations  are 

f4-/o(e^-  1) 
ic  m  Ooc 
»r  *  rVr 

«V  "  Tdr 

The  set  of  parameters  is  given  by 

«»[/0^rT)r 

where  “ T ’  denotes  the  transpose.  The  input-output  model  is 
given  by  the  equation 

y(i)  *  /0(etfl*(,)  —  1 )  +  Gu(t) 
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(U  *1 


% 


•>>t .jJt)+  lr(t)j 


m*  * )  CUatJ 


Fi|.  I .  The  illustrative  example. 

+* r  Jj*  “(r)dr  +  yjj^'  «(r)<#rj2‘ 

We  now  present  some  definitions  that  are  also  available  in 
Part  I  [I],  bat  arc  relevant  to  the  results  presented  here. 

Definition  I:  The  parameter  point  a0  c  A  is  said  to  be  lo¬ 
cally  dlagnosabl e  if  there  exists  an  open  neighborhood  B  of 
ofi,  such  that  V«  c  B,  a  *  ofi.  3«  c  U,  such  that 

fiu,efi)  f*fiu,a).  a 

.  Definition  2 t  The  system  (1)  is  said  to  be  locally  diagnos- 
able  if  almost  all  (ix^  all,  except  possibly  those  in  some  dosed 
'subset  of  A  with  zero  Lcbesgue  measure)  a  a  A  are  locally 
diagnosable.  Q 

Definition  3:  Let  Af(a)  be  a  matrix  whose  elements  are 
continuous  functions  of  a  everywhere  in  A.  A  parameter  point 


\s: 

Ir 


to  the  self  duality  of  Y  and  R*./*(u,a)  is  a  linear  map  from 
Y  to  IR*  (4],  To  derive  the  adjoint  map,  we  first  define  the 
inner  product  in  the  Hilbert  space  Y  by 

Or \y)  •  x(t)Ty{t\dt  Yx,y  e  Y 

and  in  R  *,  in  the  usual  way 

(a|h)«aTh  Vo, Is  Rk. 

It  now  follows  from  the  definition  of  the  adjoint  map  [4] 
that 

/•(u,  a):y  —  ^  (u(r),  a)Ty(t)dt  Yy  e  Y. 

Thus,  Jm(u,  a)J(u ,  a)  is  a  linear  map  from  R*  to  R*.  and 
can  be  directly  identified  with  its  matrix  representation  by  the 
following  equation: 

J*(u,  a)J(u,  a)  ■  f  (u(r),  a)T^f-  (u(r),  a)dt.  (4) 
Jo  da  da 

Note  that  J*(u,  a)J(u,  a)  is  a  symmetric,  positive  semidefinite 
matrix.  Observe  that  for  our  example,  the  top  left  2X2  sub* 
matrix  of  /*(u,  a)J(u.  a)  is  given  by 


-  l)2Jt 


(*'•<»>-  l)(/0e(r)*4.<i>)dt 


J* '  (egMi,)  -  1  )(/ou(r)r*-<'>)dr 
o  • 

f  ‘  (lou{i)e^‘))2dt 

Jo 


arc  A  is  said  to  be  a  regular  point  of  M(a),  if  there  exists  an 
open  neighborhood  of  a0  in  which  M[et)  has  constant  rank. 

a 

To  define  our  test  matrix,  which  is  similar  to  the  ooc  intro¬ 
duced  in  [1],  we  let  J(u,  a)  denote  the  Frechet  derivative  of 
/with  respect  to  a,  evaluated  at  u  and  a.  With  u  c  \J  fixed, 
/maps  A  to  Y,  hence, /(u,  a)  is  a  linear  transformation2  from 
R*  to  Y.  /(«,  a)  can  be  described  by  the  following  map: 

/(u,  a):  a  —  j-^-  (u,  «)|o  Vo  «  R* 

where  -f-  (u,  a)  is  the  matrix  of  partial  derivatives  of/with 
da 

respect  to  the  components  of  a,  computed  in  the  usual  way  and 
evaluated  at  (u,  a).  For  our  example* 

&  („,  a)  -  **•<'>  -  1 1  /ou(r)r4-(0|  u(r)  | 

da  I  |  I  | 

£>>4/;  Hi- <3) 

Note  that,  in  (3),  when  ^  (u,  a)  is  multiplied  on  the  right  by 
da 

an  element  of  R5,  i.e.,  the  parameter  space,  the  result  is  a 
scalar  function  of  time,  i.e.,  an  element  of  Y.  However,  (3)  is 
not  the  matrix  representation  of  the  linear  operator  J(u,  a). 
Let  /•(«,  a)  denote  the  adjoint  map1  of  J(u,  a)  [4].  Due 


2  Intuit  ivtly,  the  Fractal  dchvauv*  is  a  uafcm  pUac  approximation  to  the 
function  at  the  point  at  which  it  is  evaluated. 

1 A  (CMrslisatiou  of  th«  concept  of  the  transpose  of  a  matrix. 


The  remaining  elements  of  the  matrix  are  similarly  defined. 
Note  that  /•(«,  a)J(u,  a)  iso  matrix-valued  function  of  u  and 
a.  To  derive  a  test  matrix  that  depends  only  on  oe,  we’integrate 
/*(«.  a)J(u,  a)  over  all  possible  inputs.  To  this  end,  let  u 
denote  a  positive  measure  defined  on  the  Bore!  sets  of  U  [6], 
such  that  u(V)  >  0,  for  every  nonnull  open  set  Kin  U.  Then 
we  define  the  test  matrix 

R(a)  *  J^/*(u, «)/(“.  <*)du(u).  (5) 

III.  Conditions  for  Local  Diagnosabiuty 

In  this  section,  we  first  present  a  theorem  that  gives  a  nec¬ 
essary  and  sufficient  condition  for  the  local  diagnosability  of 
a  parameter  point.  We  then  extend  this  theorem  to  give  a 
condition  for  the  local  diagnosability  of  the  system. 

Theorem  I:  Let  «(•)  be  any  admissible  measure  for  which 
R(a)  exists  Ya  e  A,  and  let  ofi  be  a  regular  point  of  R(a). 
Linder  these  conditions,  the  parameter  point  a°  is  locally  di- 
agnosabie  if  and  only  if  R(a°)  is  nonsingular. 

Proof— If:  By  the  integral  form  of  the  mean  value  theo¬ 
rem  [7]  we  have 

fiu,a)-fiu,aP)  -  C  ^iu,sa°  +  (1  -  s)a)ds[a  -  cfl] 
Jo  da 

Vu  6  U,  a  e  A.  (6) 

Suppose  now  that  ofi  is  not  locally  disgnosabie.  Then  there 
exists  an  infinite  sequence  of  vectors,  a‘  —  a0,  i  •  1,  •  •  %  •, 
such  that 
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/(«.  O')  "f(u,  cP)  Vu  €  U,  Vi  €  A'. 

Using  (6)  we  have,  for  ail  u  and  i 

f  ( u ,  ra°  +  (1  -  s)ai)ds[ai]  ■  0  (7) 

jo  da 

where  o'  •  [a*  —  a0]/!  (a*  —  a°}i  Since  a1  is  normalized  to 
-lie  on  the  unit  sphere  of  R  *,  which  is  a  compact  set,  a‘  admits 
a  convergent  subsequence  a‘l,  whose  limit  a  also  has  unit  norm. 
Using  the  convergent  subsequence  in  (7),  we  have 

lim  f  ^“(u,sa°+ (1 -s)a'-0dlr(«,>l 

)-•  Jo  act 

*  f  ^  (u,  cP)ds  (nj  -  (u,  a°)a  •  0. 

jo  act  act 

Since  a  has  unit  norm,  it  is  nonzero,  while 

- « w*  -  xX[£(“(,)-Hr 

X  («(/),  a°)aj</f  du(u)  *  0 


the  finite  dimensionality  of  the  parameter  space,  but  does  not 
require  that  V  and  Y  be  finite  dimensional.  Furthermore,  the 
result  is  independent  of  the  choice  of  measure  u. 

Remark  I:  In  exactly  the  same  way  as  has  been  done  for 
linear  systems  (2}  and  nonlinear  memoryless  systems  [1],  we 
can  establish  that 

*  k  —  rank  (7?(a0)} 

is  the  measure  of  solvability  of  the  parameter  point  a°,  and  that 
its  generic  value  p  is  the  measure  of  testability  of  the  system. 

□ 

The  rank  test  on  A(cr)  becomes  a  test  for  the  local  diag- 
nosabiiity  of  .the  system,  under  exactly  the  same  conditions  as 
in  the  dc  case.  This  is  immediately  obvious,  since  in  both  cases 
the  analysis  is  restricted  to  the  finite  dimensional  parameter 
space. 

Theorem  2:  Suppose  that /(u,  a)  is  analytic  with  respect  to 
a.  Then 

1 )  almost  all  a  e  A  are  regular  points  of  /?(a), 

2)  the  system  is  locally  diagnosable  if  and  only  if  generic 

rank  R(a)  •  k.  a 


implying  that  R(at°)  is  nonsingular. 

Only  If:  Conversely,  if  R(cfi)  is  singular  it  follows  from 
the  assumption  that  cfl  is  a  regular  point,  and  Lemma  1  of  [3]* 
that  there  exists  an  open  neighborhood  Vofafl  and  a  contin¬ 
uous  R*  valued  function  c(a)  r*  0  defined  on  V  such  that 

0  »  cr(a)*(a)c(a)  «  ("(0. 

X  («(/),  a)c(a)jrfr  du(u). 


fv.  Simplified  Test  for  Local  Diacnosability 
and  Existence  of  Test  Inputs 

In  the  preceding  section,  we  have  discussed  the  conditions 
under  whictnhe  test  for  the  local  diagnosabilitv  of  a  system 
reduces  to  a  rank  test  on  the  matrix  R{cP),  evaluated  at  a 
randomly  chosen  point  a°.  Similar  to  the  development  in  (1], 
our  next  step  is  to  derive  a  simple  sufficient  condition  for  the 
local  diagnosability  of  a  parameter  point. 

Proposition.  1:  Suppose  that  there  exists  m  inputs  u‘,i  »  1, 
•••.me  U  such  that  the  matrix 


Since  «(u)  >  0  and  (u,  a)  is  continuous  in  u,  this  implies 

do 

that 

da  *  0  Va  e  V,  Vu  e  U. 

Finally,  we  define  a  curve  a(s)  e  V  by  .the  differential  equa¬ 
tion 


“  ■  e(a);  a(0)  •  a®. 

Substituting  a(s)  into  /(u,  a)  and  computing  its  derivative 
with  respect  to  a  via  the  chain  rule  [7],  we  obtain 

("•  «W>  •  £“  (“.  «(*))  “  ■  («,  a(s))c(a)  •  0 


showing  that/Tu,  a(s))  is  consunt  along  a  curve  emanating 
from  cfl.  Since  a(s)  is  independent  of  u,  this  implies  that  oP 
is  not  locally  diagnosable,  thereby  completing  the  proof.  □ 
For  brevity,  in  the  above  theorem,  we  have  used  a  proof  that 
is  similar  to  the  proof  of  Theorem  1  of  [3]  and  is  shorter  than 
the  corresponding  one  in  Part  1  [1].  Note  that  the  proof  uses. 


£  /•(.»',  cP)J(u‘,  cP) 

/•i 

is  nonsingular.  Then  aP  is  locally  diagnosable. 

,  Proof:  With  ul  i  *  1 ,  •  •  •,  m  fixed,  the  function* 

col  (Au‘.a),  i  -  1,  •  •  •,«) 

is  a  map  from  A  to  Ym.  By  the  inverse  function  theorem  {7], 
and  the  definition  of  local  diagnosability,  cP  is  locally  diag¬ 
nosable,  if  the  linear  map 

col[/(i#',  a°),  /  ■  !,•••,«] 

From  R*  to  Ym,  is  injective.  This  is  true  if  and  only  if*  [7] 

Null  space  [col(7(u',  o°),  i  «  l ,  •  •  •,  m)J  »  (0j 

«•  dim  range  [row(/*(u',  a°),  i  ■  1 ,  *  *  •,  m))  »  k 

«•  dim  range  (row(/*(u',  cP),  /■  l,  ••*,«)  •  col(/(u\ 
aP),i  «  1,  k 

—  det  I  £  cP)J(u‘,  cP)  *0.  □ 

If- 1 

We  now  consider,  once  again,  the  illustrative  example. 


4  This  fact  is  ate  muMithed  in  the  initial  naps  of  the  proof  of  Lamina  1 

•f  III* 


5 eoM, B)  a  i  [A aj. 

i  ® 
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Recall  (3)  and  (4).  As  an  example  for  the  above  proposition, 
we  evaluate  Jm(u,  a)J(u .  a)  for  the  input 

u°(/)w/  Yr#(0,  l] 

and  the  parameter  values 

Note  that 

&  (u®.  efi)  -  l#*  -  1  Waiti/llt*/!) 


yt  t  ) 

's;,)f  “e^T  ‘r1 1  Jt 


ii » )  (u) 


‘it 

a 


Note  that 


We  set 


"0.75796 

1.09726 

0.50000 

0.19247 

1.09726 

U9726 

0.33333 

0.28172 

0.50000 

OJ3333 

0.33333  ‘ 

0.12500 

0.19247 

0.28172 

0.12500 

0.05000 

.0.06613 

0.09890 

0.04167 

0.01786 

Fig.  2.  Circuit  for  Example  1 . 
a  -  Uo0GCr]r. 
cfl m  [l  1  1  1  l]r 


0.06613 

0.09890 

0.04167 

0.01786 

0.00694 


j^u(r)rfT.  (9) 


Note  that  the  above  matrix  is  nonsingular.  Hence.  a°-ts  locally  and  note  that 

diagnesabie.  Further,  since  the  input-output  map  (2)  is  ana-  */-  r  ,  t  du  i 

lytic  with  respect  to  at.  the  system  is  locally  diagnosable.  ■p(u,a°)«  e“w  -  1  l«0)e"(°lu(r)  *r(f)| 

Example  I:  Consider  the  circuit  of  Fig.  2.  Note  that  the  a  l  1  1  1  . 

circuit  is  ‘‘degenerate.’'  since  we  are  applying  a  voltage  source  j  u(r)drl.  (9) 

acrassacapachor.lt  is  included  here  solely  because  it  provides  *'°  * 

a -simple  but  interesting  illustration  of  Proposition  l.  Because  To  study  the  diagnose  bility  of  this  system,  we  try  the  test 
of  the  degeneracy  of  the  circuit,  and  since  we  assumed  zero  input 

initial  conditions.  w«  restrict  our  inputs  to  continuous  function  u’(t)  ■  t  Y»  g  [0, 1]. 

of  time  that  satisfy  the  equation  ,  . 

Evaluaung  (9)  for  u  »  u\  we  have 

«(0)-0.  df 

■  .  _ _ _  .  ...  _ _  .  .  nte«UiisrV21 

For  the  araut  of  Fig.  2,  the  voltage  *(•)  is  the  input  and  the  da 

current  >(•),  the  output.  The  various  branch  relations  are  and 


/•(«>,  cfl)J{v\ efi) 


0.75796 

1.09726 

0.50000 

0.50000 

.0.19247 


(,-/«(«*'-  1) 

ic  m  Gog 

C  , 

1cm~oi 

(t  -  K4t- 

Hence,  the  input-output  model  is  given  by  the  equation 

y(t)  -  /o(e^»>  - 1 )  +  Gu(t)  +  Cu(t)  —  (t) 


1.09726 

0.50000 

0.50000 

0.19247 

1.59726 

0.33333 

0.33333 

0.28172 

0.33333 

0.33333 

0.33333 

0.12500 

0.33333 

0.33333 

0.33333 

0.12500 

0.28172 

0.12500 

0.12500 

0.05000. 

+  r£u(r)dT.  (8) 


Note  that  the  rank  of  the  above  matrix  is  4  and  in  fact,  the  third 
and  the  fourth  columns  are  linearly  dependent.  Equivalently, 

if  the  columns  of  ^(ul,efi)  are  considered  to  be  dements  of 
9a 

a  vector  space,  the  third  and  fourth  columns  are  linearly  de¬ 
pendent. 

Consider  now  the  possible  test  input 

u2{t)»t2  Vr  a  [0, 1] 

“jj"  («*,  a°)  *  l*'1  -  1  |rV,|fJj2xJ|f,/3J 


Jm(u2,  cfi)J(u2.  efi )  • 


"0.43930 

0.62862 

0.29457 

0.50023 

0.08337 

0.62862 

0.90564 

0.41755 

0.71860 

0.11977 

0.29457 

0.41755 

0.20000 

0.33333 

0.05556 

0.50023 

0.71860 

0.33333 

•0.57M3 

0.09524 

.0.08337 

0.11977 

0.05556 

0.09524 

0.01587 

.  . .  . 

A-Vv-A. 
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Notice  that,  once  again,  the  rank  is  4  and  in  this  case  it  is  the 
fourth  and  fifth  columns  that  are  linearly  dependent.  How¬ 
ever, 


ZJ*{u‘,ofi)J(u‘,ofi) 

#•1 

.is  nonsingular,  establishing  that  the  system  is  locally  diag- 
nosable.  Finally,  it  is  easy  to  verify  that  for 

u\t)  -  |J  Vi  e  {0. 1) 

the  matrix 

is  nonsingular. 

Once  it  has  been  verified  that  a  system  is  locally  diagnose bk, 
it  still  remains  to  pick  a  set  of  test  signals  |u',/ ■  l,--,  mj.and 
to  solve  the  resulting  set  of  equations 

y-yv.a)  /-I,  -,m  (10) 

for  a  6  A.  For  this  purpose  we  require  that,  for  a  representa¬ 
tive  parameter  point  ofi ,  the  linear  map  from  R  *  to  Ym, 

Hm  A  col(/(u',  ofi),  i  m  1  ,•■**,  m) 


be  injective,  so  that  a  Newton-Raphson  type  of  algorithm  will 
be  assured  to  converge  to  ofi,  from  a  sufficiently  good  initial 
guess.  More  generally,  if  the  property  of  injectiveness  is  generic 
in  a,  almost  all  faults  can  be  diagnosed  with  this  set  of  test 
inputs.  Equivalently,  since  Hm  can  be  identified  with  the 
time-varying  matrix4 


Fm  * 


(u  *,  o°)  ■  •  •  ^  (u  *,  O!0) 


(«"*.  Of0)  («*",  ofi) 

oa\  dak 


and  the  columns  of  Fm  may  be  considered  to  be  elements  of 
Ym,  we  require  that  these  column  vectors  be  linearly  inde¬ 
pendent.  For  ease  of  exposition  we  wiH  refer  to  this  condition 
as  Fm  having  column  rank  k. 

If  Proposition  1  is  used  to  test  for  local  diagnosability,  then 
the  inputs  [u1',  i  *  1 ,  •  •  %  m|  that  are  used  to  establish  local 
diagnosability,  will  also  suffice  as  test  inputs.  More  generally, 
we  have  the  following  theorem. 

Theorem  3:  Let  u  be  an  admissible  measure  for  which  R (a) 
exists,  Var  €  A ,  and  suppose  that  R(ofi)  is  nonsingular  at  a 
regular  point  ofi  e  A.  Then  3  a  sequence  u1  e  U,  i  •  1 ,  •  •  •, 
m  £  k,  for  which  Fm  has  column  rank  k.  - 

Proof:  If  (u,  ofi)  *  0  (the  zero  element  in  Y),  V«  e 
da  i 

U,  it  follows  from  the  definition  of  R(ofi)  [see  (4)  and  (5)].  that 
the  first  row  and  the  first  column  of  R{ofi)  is  zero.  This, 
however,  contradicts  the  assumption  that  R(cfi)  is  nonsingular, 
and  we  may  therefore  assume  that  there  exists  X  u1  e  U  for 

which  -^-(u1.  ofi)  *  0.  As  such,  there  exists  u1  6  V such  that 
da, 


*  a,  it  the yih  entry  in  the  vector  a. 
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F,  has  a  column  rank  of  at  least  1 .  Using  this  fact  as  a  suiting 
point  for  an  inductive  hypothesis,  we  assume  that  there  exists 
u‘,i  ■  1,  •  •  •,  n,  such  that  the  matrix  F*  has  column  rank  j  < 
k,  where  n  <  j.  We  now  desire  to  verify  the  existence  of  a 
vector  u**1  e  V  for  which  the  corresponding  matrix  F**.,  has 
column  rank  greater  than  or  equal  to;  +  1.  To  this  end  we  let 
D  be  a  nonsingular  matrix  of  scalars  which  operate  on  the 
columns  of  F,  in  such  a  way  that  the  O'  +  1  )*t  column  through 
the  Jkth  column  of  FmD  is  zero.  Since  D  is  nonsingular,  /Vh 
will  have  column  rank  greater  than,  or  equal  to,;  +  1  for  some 
u”*'  if  and  only  if  F*+\D  has  column  rank  greater  than,  or 
equal  to,;'  +  1.  Because  of  the  special  form  of  FmD,  however, 
this  will  be  the  case,  if  and  only  if,  the  bottom  row  of  Fn+ , D 
given  by 

(u«+»,  ofi)D 
da 

is  nonzero  in  columns  j  +  1  through  k,  for  some  u“+1.  If  this 
is  not  the  case,  we  may  let  d  denote  the  (J  +  1  )st  column  of  D 
which  is  nonzero  since  D  is  nonsingular,  in  which  case,  we 
have 

(u,  ofi)d  “0  6  U. 

da 

This,  however,  implies  that 

x  ( u(t ),  a°) dldt  du(u)  -  0 

[da  J 

which  contradicts  the  assumption  that  R(afi)  is  nonsingular. 
As  such,  there  must  exist  u^. ,  for  which  Hm+ 1  has  column  rank 
greater  than,  or  equal  to;  +  1.  Repeating  the  argument  in¬ 
ductively  until  an  Hm  with  column  rank  k  is  obtained,  now 
completes  the  proof  of  the  theorem.  Note  that  since  n  Sj  at 
each  step,  m  S  k. 

Remark  2:  Recall  from  our  examples,  that  unlike  the  linear 
dynamical  [2]  and  the  nonlinear  memoryless  [1]  cases,  a  sin¬ 
gle-input,  single-output  nonlinear  dynamical  system  can  be 
tested  with  fewer  test  input  signals  than  the  number  of  pa¬ 
rameters.  □ 

Note  finally,  that  the  results  of  [1]  on  single  fault  diag¬ 
nosability  can  also  be  extended  to  the  dynamical  case.  This  is 
extremely  straightforward,  since  most  of  the  analysis  is  in  the 
parameter  space,  which  is  finite  dimensional. 

v.  Conclusions 

Our  purpose  in  this  paper  has  been  to  indicate  a  mechanism 
whereby  the  existing  diagnosability  theory  for  linear  systems 
[2}  and  memoryiess  nonlinear  systems  [1]  can  be  extended  to 
the  general  nonlinear  dynamical  case.  To  this  end  we  have 
presented  a  necessary  and  sufficient  condition  for  the  local 
diagnosability  of  nonlinear  dynamical  systems  described  by 
the  explicit  input-output  model 

y-f(u,a)  (11) 

where  u  and  y  are  elements  of  infinite  dimensional  Hilbert 
spaces,  while  a  belongs  to  A  c  IR4.  On  the  basis  of  this  con- 
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dilion,  we  h»ve  shown  thkt  for  a  diagnosaMe  system  there  exists 
a  finite  number  of  test  inputs  that  are  sufficient  to  diagnose 
the  system. 

However,  in  the  case  of  circuits  and  systems  of  any  rea¬ 
sons  bie  size,  the  explicit  relationship  (1 1)  is  difficult  to  obtain, 
and  usually  only  the  implicit  equation 

),  x(f),  y(0.  “(*). «)  “  0  Vf*0  (12) 

is  available  f8].  In  the  above  equation,  x(t)  represents  a  set  of 
internal  variables  and  V*  at  Q,x(0«  Rl,y(t)s  and  u(t) 
f  Rf.  It  follows  immediately  from  the  fact  that 

±0)  -  Dx(t) 

where  D  is  the  differentiation  operator,  that  (12)  is  equivalent 
~  to  the  operator  relation 

0(x,y,u,a)  *0  (13) 

“  where,  now,  x,  /,  and  u  are  elements  of  infinite  dimensional 
Hilbert  spaces.  The  fact  that  only  the  relation  (11),  that  im¬ 
plicitly  defines  (1 1).  is  available,  poses  no  theoretical  problems. 

.  It  is  obvhms  from  our  derivation  in  the  previous  sections,  that 
we  do  not  require  that  the  symbolic  form  of  the  input-output 
model  be  known.  We  only  require  that  given  (fl,&)  e  UX.A, 
/(fi.  dr),  and  J(u,  a),  the  Frechet  derivative  of / with  respect 
to  a  evaluated  at  (6,  dr),  can  be  determined.  A  solution  of  (13) 
or  equivalently  (12)  with  (u,  a)  -  (E,  dr)  will  give  m/[6,  dr), 
while  an  application  of  the  implicit  function  theorem  [7]  on 

*  (13)  gives  us  /(£,  dr).  In  using  the  implicit  function  theorem 
however,  we  need  to  find  the  inverse  of  the  Frechet  derivative 
of  G  with  respect  to  x  and  y .  This  is  not  a  computationally  easy 
task.  A  similar  problem  arises  if  we  attempt  to  implement  a 

*  diagnosis  algorithm  that  solves  (10)  directly  using  a  New¬ 

ton-like  technique.  From  a  computational  point  of  view 
therefore,  we  find  that  it  is  better  to  work  with  (12).  In  fact  by 
studying  a  discrete-time  version  of  (12),  with  the  variable 

i(i)  in  (12)  replaced  by  a  numerical  backward  differentiation 
formula,  as  is  done  in  a  circuit  sunuiator.  it  is  possible  to  derive 
a  sufficient  condition  for  the  diagnosability  of  a  large  class  of 
dynamical  systems,  that  is  easily  onmputed,  and  leads  directly 
to  an  impletnen table  diagnosis  algorithm. 


U!  V.  Visvanathan  and  A.  Sangtovanni-Vincentelli.  "Diagnosabiliry  of 
nonlinear  circuits  and  systems— Pan  1:  The  dc  cate."  this  issue.'  pp. 
M  9-891. 

[2]  N.  Sea  and  R.  Sicks,  “Fault  diagnosis  for  linear  systems  via  multifre- 
qtiency  measurements."  IEEE  Trans.  Circuits  Syst..  voi.  CAS- 26.  pp. 
457-465,  iuiy  1979. 

[3]  V.  Visvanathan  and  A.  SangtovantH-Vincemeili.  “Fault  diagnosis  of 
DOnbaear  memoryiew  systems."  in  Free.  IEEE  Ira.  Syntp.  Circuits  Syst., 
Houston.  TX,  1980.  pp.  1062-1086. 

[4]  A.  E  Taylor  aad  D.  G  Lay,  Introduction  to  Functional  Analysis,  2nd 
ad.  New  York:  Wiley.  1980. 

[5]  J.  T.  Schwarts.  Nonlinear  Functional  Analysis.  New  Yoric  Gorton  and 
Breach.  1969. 

[6]  P.  R.  Halmot,  Measure  Theory.  New  York:  Spnapr.  1974. 

[7]  J.  Piwdwirt .  Foundations  of  Modern  Analysis.  Now  York:  Academic. 
I960. 

[8]  G.  D.  Hacfasd,  E  K.  Brayua,  and  F.  G:  Gattevtoa,  "The  sparse  tabiaM 
approach  to  network  analytic  and  dmp,"  IEEE  Trans.  Qradt  Theory, 
voi.CT-l8.pp.  101-113. 1971. 


Richard  Saaks  (S,S9-M’65-SMrJ*-F77)  was 
bora  in  Chicago.  IL.  in  1941.  He  received  the  BE. 
degree  in  1964.  the  MS.  depot  ie  1965.  and  the 
PhJ>.  depee  in  1967  from  North* tarn  (Jaivar- 
sity,  Evaasea.  IL.  Colorado  State  University. 
Fort  Celliaa,  aad  Cornell  Uaiversity,  Ithaca,  NY, 
rwpmtivaly.  all  m  alertrical  aapaaarieg. 

He  is  pieaeetly  Pael  Whitfield  Hera  Prifwiar 
at  Owvrieal  Bapaaariag  aad  Mathemetiei  at 
Texas  Tech  Uafcenky.  Lahhocfc.  where  he  it  ia- 


feeh  taalywe.  circuit  theory,  aad  methemetioil  ryew  thaery. 

Pr.  Saaks  is  a  member  of  the  Americaa  Mathamatical  Society,  the  Society 
for  ledeeinel  ead  Applied  Mathmrirs.  eed  Sipae  XL 


A fearse  Sengjavaxti  VtecHU  (M*74).  for  a  biopaphy.  see  this  isroa.  pap 
198. 


V.VtoroMrtrte 

898. 


(STS),  for  e  photograph  end  biopaphy,  aae  tide  usee,  pap 


Analog  Fault  Diagnosis  with  Failure  Bounds 

CHWAN-CHIA  WU,  KAZUO  NAKAJIMA,  member,  IEEE,  CHIN-LONG  WEY,  and 

RICHARD  SaEKS,  fellow,  ieee 


Abstract —A  riuwilHon-Mttr-twl  Wgorirtun  (or  dw  analog  (auk  diagno¬ 
sis  pwMraa  is  propaaad  in  which  a  bound  on  Ike  txi—  number  of 
riialiaainai  (aflnrar  is  wad  to  minima*  tfca  number  of  test  points 
mailed.  The  rawltaw  nlgethhui  It  appHeaMi  to  bolh  Know  and  nonlintar 
systems  with  endtiplr  lord  or  soft  faidts  and  can  be  used  to  isolate  failures 
up  to  an  arbitrarily  specified  “replaceable  dip  or  subsystem." 

I.  Introduction 

ONCEPTUALLY,  analog  fault  diagnosis  algorithms 
can  be  subdivided  into  three  classes  [4]:  Simula  tion-be- 

-  fore-test,  simulation-after-test  with  a  single  test  vector,  and 

—  simulation-after-test  with  multiple  test  vectors.  The  former 
is  commonly  employed  in  digital  testing  and  Is  char¬ 
acterized  by  minimal  on-line  computational  requirements. 
Unfortunately,  the  high  cost  of  analog  circuit  simulation 
coupled  with  the  large  number  of  potential  fault  modes 
which  must  be  simulated  in  an  analog  circuit  limits  the 
applicability  of  simulation-before-test  algorithms  in  an 
analog  test  environment.  As  an  alternative  to  simulation- 
before-test,  a  number  of  researchers  have  proposed  simula¬ 
tion-after-test  algorithms,  in  which  the  internal  system 
variables  or  component  parameters  are  computed  from  the 

-test  data  via  a  “nonlinear  equation  solver-like”  algorithm. 
Indeed,  in  the  case  where  sufficiently  many  test  points  are 
.available  only  a  single  test  vector  is  required  mid  the  fault 
diagnosis  problem  reduces  to  the  solution  of  a  linear 
equation  [12].  [15].  Except  for  the  large  number  of  test 
points  required,  this  approach  is  ideally  suited  to  the 
analog  fault  diagnosis  problem  and,  as  such,  a  considerable 
research  effort  has  been  directed  towards  the  problem  of 
reducing  its  test  point  requirements  [4],  [10],  [13],  [14],  [16], 
One  such  approach  uses  multiple  test  vectors  to  increase 
the  number  of  equations  obtained  from  a  given  set  of  test 
points  [3],  [7].  Unfortunately,  this  is  achieved  at  the  cost  of 
greatly  complicating  the  set  of  simultaneous  equations 


i 

Fig.  1.  Test  algorithm  for  abstract  circuit  or  system. 

several  hundred  components  it  is  reasonable  to  assume  that 
at  most  two  or  three  have  failed  simultaneously.  As  such, 
rather  than  solving  a  set  of  simultaneous  equations  in 
/t-space  the  solution  to  our  fault  diagnosis  problem  actually 
lies  in  a  two-  or  three-dimensional  submanifold  which 
should  yield,  a  commensurate  reduction  in  lest  point  re¬ 
quirements.  Unfortunately,  even  though  we  may  assume 
that  at  most  two  or  three  components  have  failed  we  do 
not  know  which  two  or  three,  and  as  such,  some  type  of 
search  is  still  required.  Fortunately,  with  the  aid  of  an 
appropriate  decision  algorithm  the  required  search  can  be 
implemented  quite  simply. 

Although  the  resultant  algorithm  represents  a  divergence 
from  previous  work  by  the  authors  and  the  established 
literature  [4]  we  believe  that  it  meets  all  of  the  requirements 
for  a  viable  analog  fault  diagnosis  algorithm  as  outlined  by 
Plice  [8]  and  in  [1 1].  In  particular  the  algorithm: 

i)  is  applicable  to  both  linear  and  nonlinear  systems 
modeled  in  either  the  time  or  frequency  domain, 

ii)  can  be  used  to  locate  multiple  hard  or  soft  faults, 

iii)  and  is  designed  to  locate  failures  in  “replaceable 
modules”  such  as  an  IC  chip,  PC  board,  or  subsystem 


which  must  be  solved  and,  as  such,  the  applicability  of  the 
approach  is  limited. 

The  purpose  of  the  present  paper  is  to  describe  an 
alternative  simulation-after-test  algorithm  in  which  a  bound 
on  the  maximum  number  of  simultaneous  failures  is  used 
to  reduce  the  test  point  requirements  while  still  retaining 
the  computational  simplicity  inherent  in  a  single  test  vector 
algorithm.  Indeed,  even  though  a  given  circuit  mrv  contain 
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rather  than  in  discrete  components. 

Moreover,  this  is  achieved  at  an  acceptable  computational 
cost  and  with  minimal  test  point  requirements. 

Consider  the  circuit  or  system  which  is  illustrated  ab¬ 
stractly  in  Fig.  1.  Here,  the  individual  chips,  discrete  com¬ 
ponents  or  subsystems  are  denoted  by  circles  indexed  from 
a  to  n.  These  components  are  subdivided  into  two  groups, 
at  each  step  of  the  test  algorithm,  as  indicated  by  the 
dashed  lines  in  Fig.  1.  At  each  step  we  assume  that  one 
group;  say.  d  through  n;  is  composed  of  good  components 
and  we  use  the  known  characteristics  of  these  components 
together  with  the  test  data  to  determine  whether  or  not  the 
remaining  components;  a,  b .  and  c  in  this  case:  are  good. 
Of  course,  if  components  d  through  n  ire  actually  good 


then  the  resultant  test  results  for  components  a,  b,  and  c 
will  be  reliable.  On  the  other  hand,  if  any  one  of  the 
components  d  through  n  is  faulty  the  test  data  on  a ,  b.  and 
e  wtli  be  unreliable.  As  such,  we  repeat  the  process  at  the 
next  step  of  the  test  algorithm  with  a  different  subdivision 
of  components.  For  instance,  we  may  assume  that  a  through 
d  and  h  through  n  are  good  and  use  their  characteristics  to 
test  components,  e,  /.  and  e.  Finally,  after  a  number  of 
such  repetitions  the  test  results  obtained  at  the  various 
steps  are  analyzed  to  determine  the  faulty  components. 

Of  course,  the  number  of  components  which  may  be 
tested  at  any  one  step  is  dependent  on  the  number  of  test 
points  available  while  the  number  of  steps  required  is 
determined  by  the  number  of  components  which  may  be 
^  tested  at  any  one  step  and  the  bound  on  the  maximum 
number  of  simultaneous  failures.  As  such,  the  procedure 
yields  a  natural  set  of  tradeoffs  between  the  numbers  of 
_  test  points,  simultaneous  failures,  and  steps  required  by  the 
__  algorithm.  Indeed,  since  the  compuational  cost  associated 
with  each  step  of  the  algorithm  is  essentially  the  cost  of  a 
single  system  simulation  the  latter  parameter  is  a  natural 
measure  of  computational  cost. 

In  the  following  section  we  describe  the  simulation  model 
used  to  test  one  set  of  components  under  the  assumption 
that  the  remaining  components  are  good.  The  model  is 
formulated  in  both  the  linear  and  nonlinear  cases  and  can 
be  used  as  readily  to  test  IC  chips  and  subsystems  as 
individual  components.  Moreover,  the  requirement  that  an 
appropriate  matrix  be  invertible  determines  the  maximum 
number  of  components  which  can  be  simultaneously  tested 
from  a  given  set  of  test  points  as  well  as  the  allowable 
.  component  subdivisions.  In  Section  III  two  decision  algo* 
rithms  for  analyzing  the  resultant  test  data  are  described. 
Indeed  the  required  theory  is  reminiscent,  though  not 
identical  to,  the  r-diagnosibility  theory  developed  for  dig* 
ital  system  testing  over  the  past  decade  [1J,  [2],  (9].  In  the 
context  of  our  application  we  give  an  exact  decision  algo* 
rithm  for  die  case  of  a  single  failure  together  with  an 
analysis  of  the  possible  tradeoffs  between  test  points  and 
algorithm  steps  (read  computer  costs).  Although  an  exam 
decision  algorithm  for  the  multifailure  ease  has  yet  to  be 
developed  an  heuristic  algorithm  which  is  applicable  to 
both  the  single  and  multifailure  case  is  presented.  The 
algorithm,  which  is  based  on  an  inherently  analog  heuristic 
[6],  to  the  effect  that  two  analog  errors  will  never  cancel, 
has  proven  to  be  highly  reliable  while  simultaneously  re¬ 
ducing  the  number  of  steps  required  from  that  of  the  exact 
(single  failure)  algorithm.  Finally,  Section  IV  is  devoted  to 
a  number  of  examples  including  three  nonlinear  circuits 
and  five  linear  circuits  with  as  many  as  200  components. 
'  Using  these  eight  circuits  some  400-plus  simulations  of  the 
algorithm  were  carried  out  for  both  the  single  and  multiple 
failure  cases  with  both  hard  and  soft  faults. 

II.  The  Simulation  Model 

Although  our  test  algorithm  can  be  formulated  in  terms 
of  any  of  the  standard  system  models  for  the  purpose  of 
this  exposition  we  will  assume  a  component  connection 


model  for  the  circuit  or  system  under  test  [3).  In  the 
nonlinear  case  the  unit  under  test  is  represented  by  a  set  of 
decoupled  state  models  characterizing  its  components 
and/or  subsystems  together  with  an  algebraic  connection 
equation  as  follows: 

*,»/(*, .a,) 

b,= x,(Q )  =  0,  »  =  1,2.-  •  •  ,n  (2.1) 

and 

a  —  Lub  +  L2iu  (2  2) 

y  =  L2lb+L22u.  (2J) 

Here,  a  =  coi(a,)  is  the  column  vector  composed  of  the 
component  input  variables ,  b  —  col(b,)  is  the  column  vector 
composed  of  component  output  variables,  x,  —  col(xj)  is  the 
column  vector  composed  of  the  component  state  variables,  u 
is  the  vector  of  external  test  inputs  applied  to  the  system 
and  y  is  the  vector  of  system  responses  measured  at  the 
various  test  points.  Although  the  component  connection 
model  is  not  universal  it  is  quite  general  and  subsumes 
most  of  the  classical  topological  connection  models  com* 
monly  used  in  circuit  and  system  theory  [3].  Moreover,  its 
inherently  decoupled  nature  is  ideally  suited  to  the  test 
problem  wherein  we  desire  to  distinguish  between  the 
characteristics  of  the  individual  system  components.  Al¬ 
though  these  components  may  be  taken  to  be  elementary 
RLC  components  and/or  discrete  semiconductor  devices, 
in  practice  the  “components"  are  taken  to  be  the  “re¬ 
placeable  modules*’  within  the  circuit  or  system,  under  test; 
say,  an  IC  or  a  “throw-away”  circuit  board. 

At  each  step  of  the  test  algorithm  we  subdivide  the 
“components”  into  two  groups  denoted  by  “1”  and  “2” 
with  the  components  in  group  “1”  assumed  to  be  good  and 
used  together  with  the  known  values  of  u  and  y  to  compute 
the  component  input  and  output  variables,  a,  and  b,,  for 
the  components  in  group  “2”.  Although  computationally 
we  prefer  to  work  with  the  decoupled  component  equations 
for  notational  brevity  we  combine  the  equations  for  the 
components  in  each  group  into  a  single  equation 


i'  =  f'(x',a') 
b'  =  g'(x'.a'). 

x‘(0)  =  0 

(2.4) 

x2  =  /:(x2,a2) 
b2  =  g2(x2,a2). 

x2(0)  =  0. 

(2.5) 

Here,  x1.  a\  and  b 1  are  the  vectors  of  group  “1”  compo¬ 
nent  state  variables,  inputs,  and  outputs;  and  similarly  for 
x:.  a2,  and  b:.  To  retain  rotational  compatibility  with  (2.4) 
and  (2.3)  we  reorder  and  partition  the  connection  equa¬ 
tions  of  (22}  and  (2.3)  to  be  conformable  with  (2.4)  and 
(2.5)  as  follows: 


a1  =  O'  +  I’n^2  + 

(2.6) 

(2.7) 

y=L':,b'^Llb2-Lxu. 

(2.3) 
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Given  (2.4)-(2.8)  our  goal  is  to  compute  the  group  *‘2” 
component  variables,  a1  and  b :,  given  the  test  input,  u,  the 
measured  tes:  responses,  v,  and  aii  assumption  to  the  effect 
that  the  group  "1  ”  components  are  not  faulty.  To  this  end 
we  assume  that  L?,  admits  a  left  inverse,  [L§,J~£,  which,  in 
turn,  determines  the  allowable  component  subdivisions. 
Under  this  assumption  one  may  then  formulate  a  compo¬ 
nent  connection  model  for  a  “ pseudo  circuit  ”  composed  of 
the  group  “1”  components  with  external  input  vector  up  = 
col  ( u,  y )  and  external  output  vector  yr  =  col  ( a 2.  b2 )  in  the 
form 


test  algorithms  are  tabulated  as  follows: 


—  f\x\ax) 

b'  =  g,(x\a‘),  x'(0)  =  0 

a'  =  Kubl  +  K2lup 
y'  =  Kvb'  +  Kau'. 


^'2  —  [  ^'l2  —  ^m[^2i]  ] 


(2.12) 


(2.13) 


(2.14) 


*22  = 


-M,r 


-12  i  ^-nl^ji] 

T  r.n-t 


(2.15) 

Since,  in  our  test  problem  both  u  and  y  are  known,  the 
above  equations  can  be  solved  via  any  standard  circuit 
analysis  code  to  compute  yf  =  (a2,b2).  Now,  under  our 
assumption  that  the  group  “1”  components  are  not  faulty 
yf  -  (a2,  b2)  represents  the  inputs  and  outputs  which  actu¬ 
ally  appeared  at  the  terminals  of  the  group  M2”  compo¬ 
nents  during  the  test.  As  such,  we  may  determine  which  of 
the  group  **2”  components  are  faulty  by  solving  (2.5)  with 
input  a1  and  checking  to  determine  whether  or  not  the 
resultant  output  coincides  with  b2.  Of  course,  since  our 
assumption  to  the  effect  that  the  group  “1”  components 
are  not  faulty  may  not  be  valid  the  results  of  this  test  are  not 
reliable.  As  such,  we  repeat  the  process  a  number  of  times 
with  different  choices  for  the  subdivision- of  the  compo¬ 
nents  into  group  “1”  and  group  “2”.  Here,  the  only 
constraint  on  the  choice  of  subdivisions  is  the  requirement 
that  [Z.2|)-t  exist  while  the  number  of  combinations  em¬ 
ployed  is  limited  only  by  the  cost  of  the  required  simu¬ 
lations.  The  results  of  the  several  steps  in  the  test  algorithm 
are  then  analyzed  via  the  techniques  described  in  the 
following  section  to  determine  those  components  which  are 
actually  faulty.  To  this  end  the  results  of  each  step  of  the 


“1”  a  b  c  •• 

■  k 

0 

X 

1 

y 

0 

2 

(2.9) 

(2.10) 

(2.11) 


Indeed,  some  algebraic  manipulation  of  (2.6)-(2.8)  to¬ 
gether  with  the  assumption  that  [Ll,]"*-  exists  will  yield 


Here  a,b,c,-  ■  -,k  denote  the  group  “1”  components  for  a 
given  step  of  the  test  algorithm,  x,y,---,z  denote  the 
corresponding  group  “2”  components  while  the  binary 
annotation  associated  with  the  group  “2”  components 
indicates  whether  this  step  of  the  test  algorithm  indicated 
that  they  were  good  (0)  or  bad  (1).  Although  this  tabular 
notation  is  somewhat  cumbersome  we  will  eventually  gen¬ 
erate  a  binary  array  indexed  by  the  group  “l”  and  group 
”2”  components  in  the  process  of  our  decision  algorithm  in 
which  case  the  tabular  notation  proves  to  be  convenient 
For  linear  systems  one  may  formulate  an  identical  algo¬ 
rithm  in  which  the  component  equations  (2.1)  are  modeled 
in  the  frequency  domain  via 

5,  =  Z,a, ,  i  =  1,2,*  •  •  ,n  (2.16) 

where  we  have  suppressed  the  r-variable  for  notational 
brevity.  Then  upon  subdividing  the  components  into  two 
groups  characterized  by  the  equalities  b'  =  Zal  and  b'  =  Za2 
and  solving  the  resultant  equations  under  the  assumption 
that  exists  one  obtains  an  equation  in  the  form 

yr  —  Muf.  Specifically, 

a2  =  Mtlu  +  Mi2y  (2.17) 

b2  ~  M2tu  +  M^y  (2.18) 

where 

•Z'[iiI-L\l1[t!,]‘iA22])+[^2-^[^1]‘^22 

(2-19) 

A^2=(Ur,-^[Ly-^,] 

•  {i-z,[iyt-Liu^r,4,]r 

+  (2.20) 

^2,  =  -  ( [ Lh]  " LlAi { 1  -  Z' 1  [ L\\  -  L\2  [ Ll]  -  LL\, }  " ' ' 
■Z'{l\,~  L\2 [L|,]“  LL22] )  -  [ L'2]  ’  LLu  (2.21 ) 
**22  =  -  ([ Mi]  ‘  *4.  { 1  -  2' [ L\\  -  L',*  [L?,]  -  lL\ , }  " ' 

+  (2.22) 


Although  these  expressions  appear  to  be  foreboding  they 
nay  all  be  computed  with  the  aid  of  only  a  single  matrix 
inversion.  Moreover,  since  the  Mt)  are  independent  of  the 
test  data  and  computed  in  terms  of  the  nominal  values  of 
me  group  “1”  components  they  may  be  computed  off-line 
and  stored  in  a  data  base  to  be  retreived  at  the  time  a  test 
it  conducted.  Furthermore,  since  only  a  single  test  vector  is 
-  required,  single  frequency  testing  can  be  employed  in  which 
case  the  M , ,  need  only  be  computed  at  a  single  frequency. 
As  such,  the  only  on-line  computation  required  for  the 
fault  diagnosis  of  a  linear  system  is  the  matrix- vector 
multiplication  indicated  by  (2.17)  and  (2.18)  together  with 
the  computation  of  Z2a2. 

Unlike  the  linear  case,  if  one  is  working  with  a  nonlinear 
circuit  or  system,  the  simulations  required  to  compute  a2 

-  and  b2  require  a  priori  knowledge  of  y  and  u  and  thus  must 
be  carried  out  on-line.  In  practice,  however,  relatively  few 
time  steps  are  required  by  these  simulations,  thereby  mini- 

-  mizing  their  running  time.  Moreover,  all  simulations  are 
-.carried  out  with  nominal  components  allowing  one  to  use 

standard  computer-aided  design  circuit  models.- Indeed, 
since  the  group  “2”  component  models  are  only  invoked  at 
the  final  step  of  the  analysis  one  can  avoid  simulating 
‘'troublesome”  components  by  always  including  them  in 
group  ”2”  though  this  usually  means  that  additional  test 
points  will  be  required.  As  such,  one  can  avoid  simulating 
“fuzzy”  components  which  do  not  admit  a  viable  simu¬ 
lation  model  and/or  nonlinear  components.  Indeed,  if 
sufficiently  many  test  points  are  available  to  permit  all 
nonlinear  components  to  be  included  in  group  “2”  a  linear 
'simulation  model  such  as  that  of  (2.17)  and  (2.18)  may  be 
employed  even  for  a  nonlinear  system. 

III.  Decision  Algorithms 

Since  the  results  of  the  tests  described  in  the  preceding 
section  are  dependent  on  our  assumption  that  the  group 
“l”  components  are  not  faulty  they  are  not  immediately 
applicable.  Following  the  philosophy  initiated  by  Pre- 
parata.  Metze,  and  Chein  [9]  in  their  study  of  selftesting 
computer  networks,  however,  if  one  assumes  a  bound  on 
the  maximum  number  of  faulty  components  it  is  possible 
to  determine  the  actual  faultfs)  from  an  analysis  of  the  test 
results  obtained  at  the  various  steps  in  the  algorithm.  To 
this  end  we  will  give  a  complete  analysis  of  the  theory 
required  to  locate  a  single  fault  together  with  an  heuristic 
which  is  applicable  to  the  multiple  fault  case. 

.  Let  us  assume  that  at  most  one  circuit  component  is 
faulty  and  that  the  test  results  obtained  from  a  given  step 
of  the  algorithm  indicate  that  all  group  “2”  components 
are  good  as  indicated  in  the  following  table: 
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a  b  c  •  •  •  k 
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i  i 
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■ 

In  this  case  we  claim  that  the  group  “2”  components  are. 


in  fact.  good.  Indeed,  if  a  group  two  component  were 
actually  faulty  then  our  test  results  are  incorrect,  which 
could  only  happen  if  one  of  the  group  “1”  components  was 
faulty.  As  such,  the  system  would  have  two  faulty  compo¬ 
nents  contradicting  our  assumption  to  the  effect  that  at 
most  one  component  is  faulty. 

Now.  consider  the  case  where  the  results  from  a  given 
step  of  the  test  algorithm  indicate  that  exactly  one  group 
“2”  component  is  faulty:  say,  x 


In  this  case  the  same  argument  we  used  above  will  guaran¬ 
tee  that  the  components  which  test  good:  say,  y  through  r, 
are,  in  fact  good.  On  the  other  hand  we  have  no  informa¬ 
tion  about  x.  It  may  be  faulty  or,  alternatively,  the  test 
result  may  be  due  to  a  faulty  group  “1”  component 
Finally,  consider  the  case  where  two  or  more  group  “2” 
components  test  bad  in  a  given  step  as  indicated  in  the 
following  table: 


Since,  under  our  assumption  of  a  single  failure,  it  is  impos¬ 
sible  for  two  or  more  group  “2”  components  to  be  faulty, 
this  test  result  implies  that  at  least  one  of  the  group  “1” 
components  is  bad.  On  the  other  hand,  since  we  have 
assumed  that  there  is  at  most  one  faulty  component  the 
faulty  group  “l"  component  is  the  only  faulty  component 
and,  as  such,  the  group  “2”  components  arc  all  good. 

Consistent  with  the  above,  at  each  step  of  the  test 
algorithm,  either  all  or  all  but  one  of  the  group  “1” 
components  are  found  to  be  good.  As  such,  if  we  choose 
our  subdivisions  so  that  the  components  which  are  found 
to  be  good  at  one  step  of  the  algorithm  are  included  in 
group  “1"  in  all  succeeding  steps  we  will  eventually  arrive 
at  a  group  “1”,  all  of  whose  components  are  known  to  be 
good.  As  such,  the  test  results  obtained  at  that  step  will  be 
reliable,  thereby  allowing  us  to  accurately  determine  the 
faulty  components  in  group  “2”.  Although  the  number  of 
components  in  group  “1”  and  group  “2”  may  vary  from 
step  to  step  (especially  if  we  work  with  multivariate  compo¬ 
nents)  if  we  assume  that  group  ”1”  contains  n-m  compo¬ 
nents  and  group  “2”  contains  “m”  components  at  each 
step  of  the  algorithm  then  the  process  will  terminate  in 
approximately  n /rr.  steps.  Since  the  computational  cost  of 
the  algorithm  is  proportional  to  the  number  of  steps  (essen- 
\ially  the  cost  of  one  simulation  per  step)  while  m  is 
determined  by  the  number  of  allowable  test  points  the 
ratio  n/m  represents  a  natural  measure  of  the  possible 
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tradeoffs  between  test  point  and  computer  requirements 
when  employing  the  algorithm  in  a  single  fault  made. 

Unlike  the  single  fault  case,  at  the  time  of  this  writing, 
we  do  not  yet  have  an  exact  decision  algorithm  for  the 
multiple  fault  case.  Following  Liu.  however,  the  problem 
can  be  greatly  simplified  if  one  adopts  an  “analog  heuris¬ 
tic"  to  the  effect  that  two  independent  analog  failures  will 
neNer  cancel  [6].  Needless  to  say,  this  is  an  inherently 
analog  heuristic  since  two  binary  failures  have  a  fifty-fifty 
chance  of  cancelling  one  another.  In  the  analog  case, 
however,  two  independent  failures  are  highly  unlikely  to 
cancel  one  and  another  (as  long  as  one  works  with  reasona¬ 
bly  small  tolerances).  Indeed,  in  the  totality  of  some  72 
simulations  of  the  algorithm  using  the  heuristic  which  we 
have  run  on  three  different  circuits  it  has  never  failed. 

-  Recall  from  our  discussion  of  the  single  fault  case  that 
whenever  a  test  result  indicates  that  a  component  is  good 
then  it  is.  in  fact.  good.  Although  this  is  not  rigorously  true 
“in  the  multiple  failure  case  it  is  true  under  the  assumption 
of  our  heuristic.  For  instance,  consider  the  test  results 
indicated  in  the  following  table  in  which  x  is  found  to  be 
good. 


Now,  if  x  is  actually  faulty  there  must  be  a  fatuity  group 
“1"  component  whose  effect  is  to  cancel  the  error  in  x  as 
observed  during  this  step  of  the  test  algorithm.  This  is, 
however,  forbidden  by  our  heuristic  and,  as  such,  we 
conclude  that  x  is  actually  good. 

Interestingly,  our  heuristic  can  be  carried  a  step  further 
than  indicated  above  since,  under  our  heuristic,  a  bad 
group  "1”  component  would  normally  yield  erroneous  test 
results.  An  exception  would,  however,  occur  if  some  of  the 
group  “1”  components  are  totally  decoupled  from  some  of 
the  group  “2”  components.  As  such,  if  prior  to  our  test  we 
generate  a  coupling  table  (by  simulation  or  a  sensitivity 
analysis)  which  indicates  whether  or  not  a  faulty  group  “1" 
component  will  effect  the  test  results  on  a  group  “2” 
component,  our  heuristic  may  be  used  to  verify  that  certain 
group  “1”  components  are  good  whenever  a  good  group 
“2"  component  is  located.  Consider  for  example  the  fol¬ 
lowing  table: 


in  which  a  “I”  in  the  /-  j  position  indicates  that  the  test 
results  for  component  i  are  affected  by  component  j  while 
a  “0”  in  the  /- j  position  indicates  that  component  j  does 


not  affect  the  test  results  for  component  t.  Now,  since 
component  x  has  been  found  to  be  good  in  this  test  our 
heunstic  implies  that  those  group  “1”  components  which 
are  coupled  to  x  in  this  test  are  also  good.  Similarly,  since  : 
is  good  the  heuristic  implies  that  b  and  c  are  also  good. 
Thus  with  a  single  test,  we  have  verified  that  or,  r,  a.  b,  c, 
and  k  are  all  good. 

Since  in  any  practical  circuit  the  coupling  table  is  com¬ 
posed  mostly  of  l’s  it  has  been  our  experience  that  rela¬ 
tively  few  steps  of  the  algorithm  will  yield  a  complete 
diagnosis.  To  implement  the  heuristic,  however,  one  must 
assume  that  the  maximum  number  of  faulty  components  is 
strictly  less  than  the  number  of  group  two  components.  If 
not,  the  test  results  at  each  step  may  show  that  all  group 
“2”  components  are  faulty,  in  which  case  no  reliable  test 
information  is  obtained.  Moreover,  the  degree  to  which  the 
number  of  group  “2”  components  exceeds  the  maximum 
number  of  faulty  components  determines  the  number  of 
algorithm  steps  which  will  be  required  to  fully  diagnose  a 
circuit. 

IV.  Examples 

To  illustrate  the  exact  decision  algorithm  for  the  single 
fault  case  consider  a  system  composed  of  eight  compo¬ 
nents:  a,  b,-  ■  ♦  .A;  in  which  any  five  may  test  the  remaining 
three.  Initially,  we  let  a  through  e  represent  the  group  “1” 
components  .and  /,  g,  and  h  represent  the  group  “2” 
components  and  assume  that  the  test  results  for  this  first 
step  are  as  indicated  in  the  following  table: 


Employing  our  exact  algorithm  for  the  single  fault  case  the 
above  table  indicates  that  components  /  and  g  are  good 
and,  as  such,  we  move  them  into  group  “1”  for  the  second 
step  of  the  algorithm  obtaining 


Since  this  test  indicates  that  two  group  “2”  components 
are  bad  which  condradicts  our  single  fault  assumption  the 
faulty  component  must  be  in  group  “1”  implying  that  d,  e, 
and  h  are  all  good.  We  therefore  move  these  components 
into  group  “1"  and  implement  the  final  step  of  our  algo¬ 
rithm  in  the  form 
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Since  all  group  “1"  components  are  known  to  be  good  this 
final  test  is  reliable  and  indicative  of  the  fact  that  b  is  the 
faulty  component 

Note,  that  the  requirement  that  Lf,  be  left  invertible 
may  make  it  impossible  to  use  some  component  subdivi¬ 
sions  in  which  case  an  alternative  sequence  of  steps  may  be 
required  in  the  above  process.  For  instance,  if  h,  e,  d,  /, 
and  g  is  not  an  allowable  subdivision  the  last  step  in  the 
above  process  might  be  replaced  by 

- - -  I  e  d  f  g  a 


indicating  that  e  and  h  are  good.  Now,  a  final  step  in  which 
c.  e,  d,  /,  and  g  make  up  group  “1”  will  be  reliable  as 
indicated  below: 


Now,  consider  the  same  single  fault  example  in  which  our 
heuristic  algorithm  is  applied  using  the  coupling  table 
indicated  below: 


Fi|.  2.  Nonlinear  scum  on  which  the  propoaad  algorithm  was  Mated 
»fw|  sinulaied  test  diu. 

which  gives  us  no  information  in  the  multiple  failure  case.1 
As  such,  we  try  another  allowable  combination  obtaining 
the  following  table: 


“2 — 

*4|tt 

|  f 

g 

a 

d 

e 

1 

b 

0 

1 

T 

1 

0 

4) 

h 

1 

1 

i 

0 

1 

1 

C 

1  o 

1 

i 

1 

0 

— - _ T 

a 

b 

c 

d 

e 

0 

/ 

1 

0 

T 

I 

0 

0 

g 

0 

0 

i 

1 

0 

1 

A 

1 

1 

i 

0 

1 

According  to  our  heuristic/and  g  are  good  and,  moreover, 
everything  in  group  “1”  which  is  coupled  to  either  for  g  is 
good.  As  such,  we  conclude  from  this  first  step  that  /,  g,  a, 
c,  d,  and  e  are  all  good.  Thus  taking  group  “1”  to  be  r,  d,f, 
g,  and  a  in  the  next  step  will  yield  a  reliable  test  for  b,  d, 
and  h  as  above. 

Finally,  consider  the  case  where  at  most  two  failures  are 
assumed  with  the  first  step  in  our  test  algorithm  yielding: 
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Consistent  with  our  heuristic  /,  a ,  d,  and  e  are  found  to  be 
good  in  this  step.  Incorporating  these  components  into 
group  “1"  for  the  following  step  we  obtain: 
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indicating  that  A,/,  g,  a.  and  e  are  good.  Coupled  with  our 
previous  knowledge  that  d  is  good  this  implies  that  all 
group  “1"  components  are  good  and  hence  this  last  step  in 
our  algorithm  reliably  indicates  that  b  and  c  are  the  faulty 
components. 

To  obtain  an  estimate  of  the  actual  performance  of  the 
algorithm  it  was  applied  to  8  different  cucuits  using  simu¬ 
lated  test  data  [17].  These  included  the  3,  6,  and  14 
component  nonlinear  circuits  of  Fig.  2(a)  -(c),  for  which 
the  full  rime-domain  implementation  of  the  algorithm  was 
carried  out  and  the  linear  12,  22,  30,  100,  and  200  compo¬ 
nent  circuits  of  Fig.  3(aHc>,  on  which  a  frequency-domain 
simulation  was  carried  out  In  addition  multiple  failure 
examples  were  run  on  the  12  and  22  component  circuits 
and  perturbations  on  the  good  component  values  were 
introduced  into  one  run  of  the  12  component  circuit  to  test 
its  robustness.  For  the  3  and  6  component  nonlinear  cir¬ 
cuits  and  the  12  component  linear  circuit  the  algorithm  was 
implemented  on  a  desk  top  calculator  while  a  TI  990/20 
16-bit  mini  and  a  VAX  11/780  32-bit  midi  were  used  for 
the  larger  circuits.  The  results  of  a  total  of  443  simulations 
of  the  algorithm  are  summarized  in  Table  I.  In  all  443  runs 
the  algorithm  failed  to  give  the  correct  answer  on  only  four 
occasions.  Indeed,  this  problem  was  encountered  only  with 

'Actually  one  can  deduce  that  ai  least  one  of  (be  (roup  “I"  components 
is  bad  since  ail  three  (roup  “2"  components  cannot  be  bad  via  our  two 
fault  assumption.  This,  in  turn,  implies  that  at  most  one  of  the  (roup  “2" 
components  is  bad. 
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Fig.  3.  Linear  circuits  on  which  the  proposed  algorithm  was  tested  using 
simulated  test  data. 


V.  Conclusions 

Although  the  proposed  algorithm  is  still  new  it  appears 
to  meet  most  of  the  criteria  formulated  by  Plice  [8]  as  well 
as  those  of  [1 1).  Although  the  on-line  computational  require¬ 
ments  for  the  algorithm  do  not  compare  with  a  simulation- 
before-test  algorithm  they  can  be  kept  within  reasonal 
bounds.  Indeed,  unlike  most  simulation-after-test  algo¬ 
rithms  no  iterative  on-line  computation  is  required.  More¬ 
over,  one  can  limit  the  on-line  computation  by  restricting 
the  number  of  algorithm  steps  (at  the  price  of  increasing 
the  ambiguity  in  the  resultant  diagnosis).  Furthermore,  in 
the  linear  case  and/or  in  the  cut  where  there  are  suffi¬ 
ciently  many  test  points  available  to  permit  all  nonlinear 
and  fuzzy  components  to  be  included  in  group  “2”  the 
major  part  of  the  computation  required  by  the  algorithm 
can  be  done  off-line. 

In  general  the  proposed  algorithm  permits  one  to  trade¬ 
off  between  on-line  computational  requirements  and  test 
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the  50  component  circuit  of  Fig.  3(c)  when  run  on  the 
16-bit  mini  computer,  wherein  we  believe  that  it  was  caused 
by  the  numerical  error  resulting  from  the  inversion  of  a 
50-by-50  matrix  without  the  aid  of  sparse. matrix  tech¬ 
niques  (17).  Indeed,  the  same  example  run  on  the  VAX 
yielded  an  accurate  diagnosis  with  any  of  the  50  compo¬ 
nents  faulty.  The  size  of  the  ambiguity  set  resulting  from 
each  simulation  of  the  algorithm  is  indicated  in  Table  I  as 
is  the  mean  number  of  on-line  simulations  required  for  the 
various  circuits.  In  addition  one  off-line  simulation  per 
component  subdivision  is  required  to  generate  the  coupling 
tables  required  by  the  heuristic  decision  algorithm. 


points.  Indeed,  as  indicated  in  Table  I,  one  can  reduce  the 
test  point  requirements  to  quite  reasonable  levels  though 
this  is  usually  achieved  at  the  cost  of  increasing  the  number 
of  steps  in  the  algorithm  (and  hence  its  on-line  computa¬ 
tional  requirements).  In  particular,  our  simulations  indicate 
that  the  algorithm  comes  dose  to  achieving  the  /it  test 
point  goal  set  in  [1 1). 

With  respect  to  the  remainder  of  the  criteria  specified  in 
[11]  the  algorithm  “looks  good.”  In  particular,  all  simula¬ 
tions  are  carried  out  using  nominal  component  models,  it 
can  test  Unear  and  nonlinear  modules  of  arbitrary  size,  and 
is  amenable  to  in  situ  testing  and  parallel  processing  tech¬ 
niques  (since  several  steps  of  the  algorithm  can  be  carried 
out  simultaneously). 

At  the  present  time  the  major  open  question  with  respect 
to  the  performance  of  the  algorithm  is  its  robustness.  In¬ 
deed,  there  is  nothing  in  the  algorithm  to  make  it  inher¬ 
ently  robust,  though  our  initial  test  for  robustness  indi¬ 
cated  by  the  asterisk  in  Table  I  proved  to  be  favorable. 
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Feedback  System  Design: 


The  Single-Variate  Case  —  Part  I* 

R.  Sacks',  J.  Murray >,  O.  Chua1,  C.  Karmokolias \ 
and  A.  Iyer 1 


Aheinct.  A  recently  developed  al|ebnic  approach  to  the  feedback  system  design 
problem  is  reviewed  vie  the  derivation  of  the  theory  in  the  single-variate  case.  This 
allows  the  simple  algebraic  nature  of  the  theory  to  be  brought  to  the  fore  while 
simultaneously  minimizing  the  complexities  of  the  presentation.  Rather  than  simply 
giving  a  single  solution  to  the  prescribed  design  problem  we  endeavor  to  give  a  com¬ 
plete  parameterization  of  the  set  of  compensators  which  meet  specifications. 
Although  this  might  at  first  seem  to  com  plicate  our  theory  it,  in  fact,  opens  the  way 
for  a  sequential  approach  to  the  design /problem  in  which  one  parameterizes  the 
subset  of  those  compensators  which  meet  the  aeooad  specification.  ..etc.  Specific 
problems  investigated  include  feedback  system  stahfBiation,  the  tracking  and  distur¬ 
bance  rejection  problem,  robust  design,  transfer  function  design,  pole  placement, 
simultaneous  stabilization,  and  stable  stabilization. 


In  1976  Youla,  Bongiorno,  and  Jabr  published  two,  now  classical,  papers 
[23,24]  in  which  a  complete  parameterization  of  the  set  of  stabilizing  com¬ 
pensators  for  a  multivariate  feedback  system  was  obtained.  In  the  ensuing 
years  this  work,  which  is  often  termed  the  YBJ  theory,  has  led  to  the 
development  of  an  entirely  new  approach  to  the  feedback  system  design 
problem.  Indeed,  their  stabilization  theory  has  been  extended  to  include: 

(i)  the  tracking  and  disturbance  rejection  problem 

(ii)  robust  design  algorithms 

(in)  design  with  a  proper  or  stable  compensator 
(iv)  transfer  function  design 
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(v)  pole  placement 

(vi)  simultaneous  stabilization 

Moreover,  much  of  the  work  has  been  extended  to  the  case  of  general  linear 
systems;  distributed,  rime- varying,  multidimensional,  etc.;  by  formulating 
it  in  an  abstract  ring  tbeortic  [8,10]  or  aigebro-geometric  setting  (17).  Un¬ 
fortunately,  these  generalizations  have  been  achieved  at  the  cost  of  increas¬ 
ing  the  complexity  of  the  theory  and,  as  such,  the  simple  algebraic  character 
of  the  work  has  been  obscured. 

The  purpose  of  the  present  paper  is  to  survey  this  literature  in  such  a  way 
as  to  illustrate  the  simplicity  of  the  theory.  To  this  end  the  presentation  is 
restricted  to  the  single- variate  case  wherein  a  simple  algebraic  theory  is 
possible.  Indeed,  by  so  doing  we  are  able  to  give  simple  single-variate 
algebraic  derivations  for  several  results  whose  true  character  has  hitherto 
been  obscured  by  the  abstract  ring  theoretic  or  multivariable  theory. 

The  key  to  our  theory  is  a  three  step  design  philosophy 

(i)  stabilization 

(ii)  achievement  of  design  constraints 

(iii)  optimization  of  system  performance 

Fine  and  foremost,  a  feedback  system  must  be  stable  and,  as  such,  the  first 
step  in  the  design  process  is  the  parameterization  of  all  stabilizing  compen¬ 
sators  tor  the  ghnm  plant.  Although  it  might  suffice  to  specify  a  single 
stabilising  compensator  if  our  goal  was  simply  to  design  a  stable  system,  in 
practice  stabilization  is  only  the  first  step  of  the  detign  process.  As  such,  we 
must  characterise  all  staMhziag  compensators  if  we  are  to  choose  among  the 
stabilizing  compensators  to  find  oae  which  also  achieves  the  design  con¬ 
straints  and/or  optimizes  system  performance.  A  complete  parameteriza¬ 
tion  of  the  set  of  stabilizing  compensators  for  the  given  plant  is  thus  ob¬ 
tained  as  a  first  sup  in  the  design  procam.  Indeed,  the  parameterization  is 
chosen  in  such  a  way  that  the  various  feedback  system  gains  are  linear 
(affine)  in  the  resultant  design  parameter,  thereby  setting  the  stage  for  the 
choice  of  a  design  parameter  which  also  achieves  the  design  constraints 
and/or  optimizes  some  measure  of  system  performance. 

Once  the  stabilizing  compensators  have  been  characterized,  step  two  of 
the  design  process  is  to  choose  a  subset  of  the  stabilizing  compensators  . 
which  also  achieve  the  prescribed  design  constraints;  tracking  and  distur¬ 
bance  rejection,  transfer  function  specification,  robustness,  etc.  Finally,  if 
any  remaining  design  latitude  exists  after  the  design  constraints  have  been 
met  it  may  be  used  to  optimize  some  measure  of  system  performance;  sen¬ 
sitivity,  energy  consumption,  etc. 

The  paper  is  divided  into  two  parts  dealing  with  the  classical  asymptotic 
design  problems r.  stabilization,  tracking,  and  disturbance  rejection;  and  a 
survey  of  modern  frequency  domain  design;  robust  design,  pole  placement, 
simultaneous  design,  respectively.  In  the  remainder  of  this  introduction  the 
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fractional  representation  theory  for  a  single-variate  system  is  developed. 
The  key  to  this  theory  lies  with  the  representation  of  a  rational  function  as  a 
ratio  of  stable  rational  functions  rather  than  as  a  ratio  of  polynomials.  Such 
a  formulation  opens  the  door  to  the  desired  generalization,  wherein  stability 
is  well  defined  even  though  no  analog  of  a  polynomial  exists.  Moreover  it 
yields  what  we  believe  to  be  a  more  natural  concept  of  coprimeness  in  which 
only  cancellations  between  (dosed)  right  half-plane  zeros  are  forbidden.  In¬ 
deed,  the  (strict)  left  half-plane  plays  only  a  minimal  role  in  the  theory. 

In  Section  2  a  derivation  of  the  YSJ  stabilization  theory  is  formulated  in 
terms  of  a  stable  coprime  fractional  representation.  Although  this  deriva¬ 
tion  has  appeared  before  [4,8],  even  in  the  single-variate  case,  a  complete 
proof  is  given  because  of  its  fundamental  nature  to  the  remainder  of  the 
work.  Indeed,  the  proof  technique  introduced  here  is  repeated,  in  one  form 
or  another,  throughout  the  paper. 

Sections  3-5  are  devoted  to  the  tracking  and  disturbance  rejection  pro¬ 
blems  [4].  Unlike  the  stabilization  problem  a  solution  to  these  problems 
may  fail  to  exist.  Necessary  and  sufficient  conditions  for  the  existence  of  a 
solution  are,  however,  given  in  the  form  of  appropriate  coprimeness  criteria 
and  a  complete  parameterization  of  the  required  set  ofxompensators  is  ob¬ 
tained  when  these  criteria  are  satisfied. 

Part  II  of  the  paper  begins  with  Section  6  in  which  the  problem  of  robust 
design  is  taken  up.  Unlike  the  stabilization  problem  for  which  every  solu¬ 
tion  is  robust  a  solution  to  the  tracking  and/or  disturbance  rejection  prob¬ 
lem  may  fail  to  be  robust.  Surprisingly,  however,  whenever  these  problems 
are  solvable  they  are  robustly  solvable.  As  such,  beginning  with  the  same 
coprimeness  criteria  used  in  the  non-robust  case  we  give  an  explicit 
parameterization  for  the  set  of  compensators  which  robustly  solve  the 
tracking  and  disturbance  rejection  problems.  This  result,  however,  only  ap¬ 
plies  to  our  single-variate  case.  In  the  general  multivariate  case  a  robust 
solution  may  fail  to  exist  even  though  a  non-robust  solution  exists  [11 J. 

In  Section  7  the  problem  of  designing  a  compensator  which  simul¬ 
taneously  stabilizes  a  feedback  system  and  realizes  a  prescribed  input- 
output  feedback  system  gain  is  investigated.  The  required  existence  criteria 
for  this  transfer  function  design  problem  are  formulated  in  terms  of  a 
divisability  condition  in  the  ring  of  stable  transfer  functions.  This  is  followed  in 
Section  8  by  an  investigation  of  the  pole  placement  problem  in  which  one 
desires  to  construct  a  compensator  which  will  simultaneously  stabilize  a 
system  and  place  the  pole;  of  its  input-output  gain  at  prescribed  points  in 
the  left  half -plane.  Interestingly,  the  extent  to  which  this  end  can  be  achieved  is 
determined  precisely  by  the  “degree”  to  which  the  plant  fails  to  be 
miniphase. 

In  Section  9  the  problem  of  designing  a  compensator  which  simul¬ 
taneously  stabilizes  two  distinct  plants  is  solved.  Although  this  two  plant 
problem  is  a  very  special  case  of  the  general  simultaneous  design  problem  it 
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is  the  one  example  of  the  problem  for  which  a  definitive  frequency  domain 
design  criterion  exists  [7]  and  is  thus  indicative  of  the  direction  of  future 
research  in  this  area.  Moreover,  the  problem  of  stabilizing  a  feedback 
system  with  a  stable  compensator  [25]  proves  to  be  a  special  case  of  this  two 
plant  problem  which  is  developed  in  Section  10. 

Section  11  is  devoted  to  a  short  discussion  of  the  "optimization  problem" 
associated  with  step  three  of  our  design  process.  Since  the  specific  optimize- 
don  one  might  choose  to  undertakr  is  dependent  on  the  physical  system 
under  study  and  its  application  the  development  in  this  section  concentrates 
on  the  interface  between  our  theory  and  the  optimization  process,  without 
going  into  specifics. 

Finally,  Section  12,  is  devoted  to  an  historical  overview  of  the  theory  and 
a  discussion  of  the  various  generalizations  end  extensions  which  thus  far 
have  been  formulated. 

Our  system  will  be  described  by  a  rational  function 


rls) 


pit) 

qis) 


(1.1) 


Such  a  system  is  said  to  be  stable  if  its  poles  lie  in  the  (stria)  left  half-plane. 
Since  the  point  at  infinity  is  taken  to  tie  on  the  imaginary  axis  this  implies 
that  r{s)  is  stable  if  and  only  if  it  is  a  proper  rational  function  and  qis)  is  a 
(strictly)  Hurwitz  polynomial. 

A  fractional  representation  for  r(s)  is  a  factorization  of  r(s)  in  the  form 

*»>  -4rr  a» 

dr{S) 

where  both  «i,(s)  and  d,{s)  are  stable  and  d,(s)  *  0.  If  one  is  given  a 
polynomial  fractional  representation  for  r(s)  such  as  in  equation  1.1  then 
one  can  take 
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where  m(s)  is  any  Hurwitz  polynomial  such  that  the  order  of  m(s)  equals 
the  order  of  r(s)  verifying  the  existence  of  the  required  fractional  represen¬ 
tation  [11]. 

We  say  that  the  fractional  representation  r{s)  ■  nr(s)/dris)  is  coprime 
if  there  exist  stable  rational  functions,  uris)  and  vr(s),  such  that 


uris)n,is)  +  v,is)dris )  ■  1 


(1.5) 


rmu 

i 

» 

i* 

L 


■j 


l*. 


n '» \>K  y*  l  ^  ^  r«  ^  V*  A  iv.WA  a  * 


Feedback  System  Design 

Recall  that  for  polynomials  l.S  is  equivalent  to  the  requirement  that  nr{s) 
and  dr{s)  do  not  have  any  common  zeros  [2].  In  our  case,  however,  where 
we  are  dealing  with  stable  rational  functions,  Equation  1 .3  imples  that  itr(s) 
and  dr(s)  have  no  common  (closed)  right  half-plane  zeros  and  conversely 
[4,11].  Although  this  represents  a  departure  from  classical  control  theory 
only  right  half-plane  zeros  cause  instability  and,  as  such,  it  is  appropriate 
that  only  right  half-plane  pole  zero  cancellations  be  forbidden. 

Unlike  the  classical  polynomial  fractional  representation  theory  wherein 
the  units  are  the  constant  functions  in  our  theory  the  units  are  the  miniphase 
rational  functions  which  are  stable  and  admit  a  stable  inverse.  That  is,  if 
r(s)  =  p(s)  / <7(s )  then  p(s)  and  q{s)  are  both  Hurwitz  polynomials  of  the 
same  order.  As  such,  the  classical  theorems  for  polynomial  fractional 
representations  may  be  reformulated'  in  our  setting  with  the  units  taken  to 
be  miniphase  rational  functions  as  follows. 

1.  Property.  Let  r(s)  =  nr(s)/df(s)  be  coprime  fractional  representation 
for  r{s)  and  assume  that  nr(s)  and  dr(s)  admit  a  common  divisor,  k(s). 
such  that 


dr(s)  *  yr(s)k(s) 
and 


where  yr(s),  xr(s)  and  k[s)  are  stable.  Then  k{s)  is  miniphase.  That  is,  the 
only  common  divisor  of  a  coprime  fractional  representation  is  a  unit. 

Proof.  Since  dr(s)  *  yr(s)k{s)  and  nr(s)  *  xr(s)k(s)  are  coprime  there 
exist  stable  ur(s)  and  vr(s)  such  that 

1  «  ur(s)n,(s)  +  vr(s)dr(s)  «  lur(s)xr(s)  +  vr(s)yr(s)]k(s)  (1.6) 

showing  that  [u,(s)jr,(.s)  +  u,(s)yr(s)  ]  is  a  stable  inverse  for  k(s)  and 
hence  verifying  that  k(s)  is  miniphase. 

2.  Property.  Let  r(s )  *  n(s)  /d(s)  be  a  fractional  representation  for  r(s) 
and  let  r(s)  *  xr(s)/yr(s)  be  a  coprime  fractional  representation  for  r(j). 
Then  there  exists  a  stable  k(s)  such  that 


and 


nr(s)  *  xf(s)k(s) 


dr(s)  »  y,(s)*(r) 

Proof.  Given  the  two  fractional  representations  let  k(s)  =  dr(s)/yr{s). 
Then  clearly  dr(s)  =  yr(s)k(s)  while 

n(s)  *  dr(s)  -  r(s)d(s)  -  dr(s)  *  xr(s)k(s)  (1.7) 
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showing  that  k(s)  is  a  common  factor  of  n.(s)  and  dr(s) .  It  thus  remains  to 
verify  that  k{s)  is  stable.  To  this  end  recall  that  since xr(s);yr(s)  is  coprime 
there  exists  stable  ur(s)  and  v,(s)  such  that 


hence 


[u,(j)jr,U)  +  vr(s )y,ls)  J  ■  I 


k{s)  *  *  lu'ls)x'is)  * 


ur(s)xr(s)dr{s) 


+  v,(s)dr(s)  »  uf(s)r(s)dr(s)  +  o,(s)dr(s)  (1.9) 


ur(s)nr(s)dr(s) 


+  vt(s)dr{s)  ■  K,(i)»,(S)  +  v f(s)d,(s) 


showing  that  k{s)  is  suble  since  we  have  expressed  it  as  a  sum  of  products 
of  suble  rational  functions. 

Note  that  since  a  coprime  fractional  represenution  always  exists  [11]  for 
r(s)  Property  2  implies  that  any  pair  of  suble  rational  functions,  xr{s)  and 
yr(s),  can  be  expressed  in  the  form  x,(s)  «  nr(s)k(j)  and  yr(s)  » 
dr(s)k(s)  where  nr(s)  and  d,(s)  are  coprime  suble  rational  functions  and 
k(s)  is  stable.  As  such,  k(s)  represents  a  greatest  common  divisor  for 
xr(s)  and  y,(s)  which  is  unique  up  to  a  miniphase  factor  via  Property  1. 

3.  Property.  Let  r(s)  *  nr(s)/dr(s )  be  a  coprime  fractional  representa¬ 
tion  for  r(s) .  Then  r(s)  is  stable  if  and  only  if  dr  (s)  is  miniphase. 

Proof.  If  dr(s)  is  miniphase  then  l/dr(s)  is  suble  and  hence  r(s),  being 
the  product  of  two  suble  functions  is  suble.  Conversely,  If  r(s)  is  suble 
then  we  may  express  /i,(s)  and  dr(s)  in  the  form 

n/s)  •  r(s)dr(s)  (1.10) 


d,(s)  -  1  dr(s)  (1.11) 

showing  that  dr(s)  is  a  common  faaor  of  the  coprime  rational  functions 
nr(s)  and  d,(s).  As  such  Property  1  implies  that  dt(s)  is  miniphase  as  was 
to  be  shown. 

4.  Example.  Consider  the  rational  function 


'  (*+ )) ' 

(s+l)  .  (s+ 1)2,  nr(s) 

f  (■t — »)  1 

(S  +  2) 


.  -  .  -  -■*  -  ■ 


For  this  system  the  usual  algebraic  manipulations  [8]  will  yield  th tfeed 
back  system  gains 
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where 
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Of  course,  the  system  is  said  to  be  stable  if  each  of  the  eight  feedback  system 
gains  of  equations  2.1  through  2.4  is  stable.  Since  the  input/output  gains, 
h^,  are  expressed  in  terms  of  the  input/plant 'input  gains,  Atjl ,  via  equation 
2.4  this  win  be  this  case  if  and  only  if  the  input/plant -input  gains  are  all 
stable. 

For  our  stabilization  theory  we  assume  that  a  coprime  fractional 
representation  for  the  plant  is  given  in  the  form 


where  np(s)  and  dp(s)  are  stable,  dp(s)  is  not  identically  zero,  and  there 
exists  stable  up(s)  and  vp(s)  such  that 

up(s)np(s)  +  vp(s)dp(s)  «  1  (2.6) 


In  our  single-variate  setting  every  plants  admits  such  a  representation  and 
hence  we  may  assume  2.5  and  2.6  without  loss  of  generality.  Our  goal  is  to 
characterize  the  set  of  compensators,  represented  in  the  form 


where  nc (s)  and  de(s)  are  stable  and  de(s)  is  not  identically  zero,  which 
stabilize  the  feedback  system.  Of  course,  we  would  also  like  2.7  to  be  a  co¬ 
prime  fractional  representation.  Indeed,  so  as  to  prevent  (right  half-plane) 
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pole-zero  cancellation  between;?(s)  and  c(s)  we  require  the  stronger  condi¬ 
tion  that 


pU)c(f) 


«p(s)ne(s) 

dp(s)de(s) 


(2.8) 


be  a  coprime  fractional  representation. 

Substituting  the  fractional  representations  p(s)  =  np(s)/dp(s)  and 
c(s)  a*  nc(s)/dc(s)  into  2.2  and  2.4  now  yields 


\h  ( s )  h  (s)' 

dp(s)dc(s) 

- np(s)dc(s ) 

dp(s)dc(s)  +np(s)nc(s) 

dp(s)dc(s)  +np{s)nc(s) 

h  (j)  h,  (J) 

dp(s)ne(s ) 

dp(s)dc(s) 

*2*1 

dp(s)dc(s)  +np(s)ne(s) 

dp(s)de(s)  +np(s)nc(s)  J 

and 


dp(s)ne(s) 

-np(s)nc(s) 

W  ’it 

s 

dp(s)dc(s)  +np(s)nc(s) 

dp(s)dc(s)  +npt s)nc(s) 

h  (5)  h_ ..  (s) 

np(s)ne(s ) 

r ip(s)dc(s ) 

•Vi  *yi 

dp(s)dc(s)  +np(s)ne(s) 

dp(s)de(s)  +»p(s)nc(s) 

(2.9) 


0.10) 


Since  the  fractional  representation  for  p(s)c(s)  given  in  2.8  is  coprime 
there  exist  stable  p(s)  and  q(s)  such  that 

P(s)ldp(s)de(s))  +  q(s)[np(s)nt(s))  «  1  (2.11) 

hence 

[/>(*) -?(*)]  [dp(j)df(s) ]  +  q(s)[dp(s)de(s)  -¥np(s)nt(s)\  -  1  (2.12) 

showing  that  the  fractional  representation  for  h,^  ( s )  given  in  2.9  is  co¬ 
prime.  As  such,  it  follows  from  Property  1  that  (s)  is  stable  if  and  only 
if  the  common  denominator,  [d,(s)df(s)  +np{s)ne{s)  ]  is  miniphase. 
Moreover,  since  (s)  must  be  stable  for  the  feedback  system  to  be  stable 
it  follows  that  [dp(s)de(s)  +  rtp(s)nc(s)  ]  must  be  miniphase  for  the  system 
to  be  stable.  Conversely,  if  this  common  denominator  is  miniphase  the 
system  is  clearly  stable  via  2.9  and  2.10.  Moreover,  if  the  common 
denominator  is  miniphase 

[d(s)dc(s)  +  np(s)ne(s) )  *>  ldp(s)de (s) ) 

(2.13) 

+  ldp(s)de{f)+np(s)/tc(s))-Hnp(s)ne(s))  +  1 

showing  that  the  corresponding  fractional  representation  for  p(s)c(s)  is 
coprime.  We  have  therefore  proven  the  following. 
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5.  Property.  Let  p(s)  »  np(s)/dp(s)  be  a  coprime  fractional 
representation  for  p(s)  and  let  c(s)  *  ne(s)/dc{s)  be  a  fractional 
representation  for  c(s).  Then  the  feedback  system  is  stable  and  p(s)c(s) 
»  lnp(s)ne(s))/[dp{s)dc{s)  ]  is  coprime  if  and  only  if 
[dp{s)de(s)  +  np(s)/te(s)  j  is  miniphase. 

Consistent  with  Property  5  the  goal  of  the  feedback  system  stabilization 
problem  is  to  characterize  the  compensators,  c(s)  *  ne(s)  / de(s ) ,  such  that 
fdp(s)de(s)  +  np(s)ne(s)  3  is  miniphase  given  the  coprime  fractional 
representation 


where 


(2.14) 


upis)np(s)  +  vPis*dp  *  1  (2.15) 

for  some  stable  up(s)  and  vp{s). 

Stabilization  Theorem:  For  the  feedback  system  of  Figure  1  let  the 
plant  have  a  coprime  fractional  representation  as  per  equation  2.14  and 
2.15.  Then  for  any  stable  w(s)  such  that  »(s)np(j)  +  vp(s)  is  not  iden¬ 
tically  zero  the  compensator 


( -  w(s)dls)  +  u  (s)  1  n  (*) 

C(l)  ■  ■  ■  "  m  ■  ...  — 

l*»(j)ii^(Jf)  +»p(j)l  de(s) 

stabilizes  the  feedback  system  and  yields  a  coprime  fractional  representa¬ 
tion  on  p(s)e(s)  -  [^(f)/rf(J)  ]/[<^(J)dc(s)  ]■  Conversely,  every 
such  stabilizing  compensator  is  of  this  form  for  some  stable  w(s ) . 

Proof.  According  to  Property  5  it  suffices  to  characterize  the  class  of 
stable  ne(s)  and  dc(s)  such  that 

dp(s)de(s)  +  np{s)ne(s)  *  k{s)  (2.14) 

where  k(s)  is  an  arbitrary  miniphase  function.  To  this  end  we  will  attempt 
to  compute  all  possible  stable  solutions  to  equation  2.16.  Multiplying  equa¬ 
tion  2.15  through  by  k(s)  yields 


[k(s)upls)]np(s)  +  [k(t)vp(s)]dp(s)  -  k(s)  (2.17) 

verifying  that 


and 


*?(x)  ■  k(s)up(s) 


(2.15) 


d*(s)  -  k(s)vp(r) 

are  panicular  solutions  to  equation  2.16.  On  the  other  hand 


(2.15) 
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dp{s)[np(s)r(s)\  +  np(s)  [  -  dp(s)r{s) )  *  0  (2.20) 

for  all  stable  r(s)  showing  that 

n*(s)  =  -dp(s)r(s)  <2-21) 

and 

d*(j)  =  np(s)r(s)  (2.22) 

are  homogeneous  solutions  to  2.16  for  ail  stable  r(s).  It  remains  to  show 
that  2.21  and  2.22  represent  all  homogeneous  solutions.  To  this  end  let 
**($)  and  <£(s)  represent  arbitrary  stable  homogeneous  solutions  to  2.16. 
That  is 

dp(s)d*(s)  +  ftp(s)n*(s)  -  0  (2.23) 

in  which  case  we  will  show  that  they  take  the  form  of  2.21  and  2.22.  As  a  can¬ 
didate  for  r(s)  let  us  take  r(s )  ■  -  n*(s)  /  dp(s)  in  which  case  we  have 

«*(s)  »  ~dp(s)r(s)  (2.24) 

verifying  2.21  and 

-«_(s)«*(s) 

d*U)  -  - -  «  np{s)r{s)  (2.25) 

verifying  2.22.  It  thus  remains  to  show  that  r(s)  «  -  n£(s)  / dp{s )  is  stable 
for  which  we  have 

rti(s)  ails) 

r{s)  =  -  — -  -  t up(*)np(s)+vp(s)dp(s)] 


n£{s)np(s)up(s) 

dp(s) 


Hi(s)vp[s) 


dp(s)d*{s)up(s) 

V?) 


n*{s)vp(s) 


*  d*{s)up(s)  -  n*U)vp{s)  .  (2.26) 

showing  that  r(s)  is  stable  since  it  is  expressed  as  the  sum  of  products  of 
stable  rational  functions.  As  such,  the  entire  solution  space  for  equation 
2.16  takes  the  form 

nc(s)  *  «*(s) +n£(s)  *  -r(s)dp(s)  4-k{s)up(s)  (2.27) 

and 

dc(s)  *dj(s)+df(s)  *  r(s)np(s)  +k{s)vp{s)  (2.2S) 
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where  k(s)  is  an  arbitrary  mini  phase  function  and  r(s)  is  an  arbitrary 
stable  function. 

Now,  assuming  that  r(s)  and  k{s)  are  chosen  so  that  tleis)  is  not  iden¬ 
tically  zero  we  obtain  the  desired  set  of  compensators  in  ti  e  form 


els) 


[ -r(s)rf#(s)  +  k(s)up(s)  ] 
['(*)*,(*) +*(f)v,(s)] 


[[-r(s)/k(s)]d As)  +■*.<*)]  [-W(i>d As)+u _(s)  ] 

■  —  - .  r  — i C. -  a  . —  *i  (U9) 

[  fr(r)/*U)  )np(s)  ♦  upls)  J  [»(s)np(s)  +  up] 

where  w(s)  m  r(s)  /k(s)  spans  the  set  of  stable  rational  functions  such  that 
[w(s)n  (s)  +  vpls)\  is  not  identically  zero. 

In  addition  to  giving  a  complete  parameterization  of  the  stabilizing  com¬ 
pensators  if  one  views  w(s)  rathe  than  els)  as  the  underlying  design  para¬ 
meter  for  our  feedback  system  the  expressions  for  the  various  feedback 
system  gains  are  greatly  simplified.  This  follows  by  observing  that  the  com¬ 
pensator  of  the  theorem  yields  the  common  denominator 

1  dpis)dtls)  ♦  *pts)*eis»  -  1  OM) 

(since  we  have  divided  k(s)  out  of  the  expression  for  els) ).  As  such,  the 
denominators  in  Equations  2.9  and  2.10  drop  out  yielding  the  following  ex¬ 
pressions  for  the  feedback  system  gains  which  are  linear  (actually  affine)  in 
the  design  parameter  w is). 

<.  CoroUery.  The  feedback  system  gains  which  result  from  the  use  of  the 
compensator  of  the  stabilization  theorem  take  the  form 


[■*»,*,  M 

-  -  »,(*)*,(«)  1 

LAj^U) 

A*j*j  <*)]  [  -  *  ipWdplt) 

"(s)*p(t)Spti)  +vpt,s)4pis)  J 

and 

rs-iw 

r  -wU)kJ(s)  +*pls)dpls) 

"(s)*p(s)Sp{t)  -*#<r)»,(s)'] 

Ayj(s)  J  L-  »{s)*pls)4pls)  +  Hp[t)  npU) 

W(s)tyi)  +»pU)npi*)  J 

Proof.  These  relationships  result  immediately  upon  substituting  2.30  and 
the  expressions  for  ne(s)  and  de{s)  of  the  theorem  into  equations  2.9  and 
2.10. 

The  theorem  gives  a  complete  parameterization  of  the  stabilizing 
compensators  for  our  feedback  system  modulo  the  requirement  that 
w(s)/t,(j)  +  u,(s)  not  be  identically  zero.  Needless  to  say,  in  our  single¬ 
variate  case  this  requirement  is  trivially  verified.  Moreover, 
w(s)np(s)  +  vp(s)  is  always  non-zero  for  some  w(s)  hence  the  existence  of 
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a  stabilizing  compensator  for  any  single-variate  plant  is  guaranteed. 

7.  Corollary.  Every  single-variate  plant  admits  a  stabilizing  compensator. 
Proof:  Consistent  with  the  theorem  it  suffices  to  verify  the  existence  of  a 
stable  w(s)  such  that  w(s)np(s)  +  vp(s)  is  not  identically  zero.  Indeed, 
either  w(s)  ■  - 1  or  w(s)  ■  0  suffices.  If  w(s)  ■  -  1  fails,  i.e., 

de(s)  »  -np(s)  +  vp{s)  ■  0  (241) 

then  the  coprimeness  equality 

1  -  up{s)np(s)  +  vp(s)dp(s)  ■  Ii^(r)  +dp(s))vp(s)  (242) 

implies  that  vp(s)  is  miniphase,  since  [up(s)  +  dp(s)  ]  is  a  stable  inverse  for 
vp{s),  in  which  case 

dc(s)  -  (0]n^(s)  +  vp(s)  m  Vp(s)  (243) 

is  not  identically  zero. 

Note  that  the  above  result  is  contingent  on  the  existence  of  a  coprime  frac¬ 
tional  representation  for  the  plant  and  therefore  may  fail  in  the  various 
generalized  settings  to  which  the  theory  can  be  extended  [8].  It  does, 
however,  hold  in  the  multivariate  case  wherein  a  coprime  fractional  repre¬ 
sentation  is  also  assured  to  exist  [11  ]. 

Occasionally,  one  desires  to  design  a  compensator  which  is  a  proper  ra¬ 
tional  function:  i.e.,  c(oo)  <  »;  rather  than  simply  asking  for  a  stabilizing 
compensator  [2,  II].  Now,  c(s)  *»  ne(s)/de(s)  is  coprime  via  Equation 
2.30  hence  ne(»)  and  de(a>)  are  not  both  simultaneously  zero.  On  the 
other  hand  nr(o»)  <  00  since  ne(s)  is  stable  hence 

c(-)  -  <  <*  (240 

de(») 

if  and  only  if  rff(  oo)  *  w(oo)/i  (®)  +  «/,(•)  ft  0.  Of  course,  in  this  case 
w(s)np(s)  +  vp(s)  is  not  identically  zero  showing  that  the  proper  stabiliz¬ 
ing  compensators  take  the  form 


c(r) 

where  w(j)  is  stable  and 


-w(s)dp{s)  +up(s) 
w(s)np(s)  +vp(s) 


(245) 


w[ee)np{ot)  + 1^(»)  MO  (240 

We  may  now  consider  two  cases.  First,  if  the  plant  is  strictly  proper, 
p( oo)  «  0,  then  np{<»)  *  0  and  since  np(s)  and  vp{s)  are  coprime  via  2. 15 
implies  that  vp(»)  ft  0.  As  such,  2.36  is  satisfied  for  all  stable  w(s).  On 
the  other  hand,  if  the  plant  is  not  strictly  proper,  p ( ao )  #  0,  then  np{»)  * 
0  in  which  case  2.36  reduces  to  w(«*)  *  -  vp(*)/np(a).  We  have  thus 
verified  the  following  corollaries  [10,  11]. 


S.  Corollary.  If  p(s)  is  strictly  proper  then  the  set  of  compensators  given 
by  the  theorem  are  all  proper  and  well  defined  for  every  stable  w(i). 

9.  Corollary.  If  p(s)  is  not  strictly  proper  then  the  compensators  given  by 
the  theorem  are  well  defined  and  proper  if  and  only  if 


*(•>  »» 


-».(•) 


*,(•) 


Finally,  rather  than  simply  looking  for  a  proper  compensator  we  may 
desire  to  design  a  stable  compensator  [23 } .  Although  such  a  compensator 
does  not,  in  general,  exist,  a  criterion  for  the  existence  of  a  stable  stabilizing 
compensator  and  an  algorithm  for  its  construction  is  given  in  Section  10  as  a 
corollary  to  the  simultaneous  stabilization  theorem.  The  result  is,  however, 
far  from  elementary  and  no  parameterization  of  the  space  of  such  compen¬ 
sators  is  known  [17], 

10.  Example.  For  the  plant  of  Example  4  with  the  coprune  fractional 
representation  of  of  Equations  1.12  and  1.13  the  required  set  of  stabilizing 
compensators  take  the  form 


els)  ■- 


is- 2)  16 

-wis)  -  +  - 

L  d+2)  J  l  3  J 

r  <1+ 1)  f  is* 2/3) 

.  (x+2)2  J  l  (1+2)  . 


c(»)  m  -w(m)  +  —  <  <m  <2Jg) 

verifying  that  the  resultant  compensator  is,  indeed,  proper  given  a  strictly 
proper  plant. 

Now,  let  us  repeat  the  above  example  using  the  alternative  coprime  frac¬ 
tional  representation  of  Equation  1.14  and  1.13  which  yields 


els)  - 


-  *<i) 


’(i-2)(i+l)‘ 

1  r>«-2>i 

.  (»♦  212  . 

]  L  3U+1) 

_J_ 

’  (*♦»)*  1 
wls)  - -  t 

.  d+2)1  J 

’  (1+2/3)  1 

.  (i+D  J 

■  (4+ 1)2  ir 

(1-2)  I 

'  16 

.  <l  +  2>*  J  L 

(1+2)  J 
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where 


-  w'  (s) 


'  (5-2)  ■ 

'  16  ' 

.  (r+2)  . 

L  3 

(s) 


'  (5+1)  ' 

'  U  +  2/3)  * 

.  U+2?  . 

(2J9) 


w'(J)  ■  w(s) 


"(5  +  1?' 

,(s+2j*. 


0.40) 


As  such,  the  same  set  of  compensators  are  obtained  from  the  alternative 
coprime  fractional  representation  as  from  the  original  representation 
though  the  parameterizations  defined  differ  by  the  miniphase  factor 
((s+l?]/{(s+2?]. 

Finally,  let  us  set  w'  (s)  «  0  in  2.39  obtaining  the  compensator 


tf(s) 


16(5+2) 
3(5+ 2/3) 


(2.41) 


Now,  r(5)  has  a  zero  at  s  *  -  2  which  cancels  the  pole  of  p(s)  at  s  *  -  2. 
This  does  not,  however,  contradict  the  requirement  that  p{s)c{s)  ** 
[np{s)nc(s))/[dp(s)de(s)]  be  coprime  since  our  coprimeness  concept 
only  forbids  right  half-plane  pole-zero  cancellations.  Of  course,  a  left  half¬ 
plane  pole-zero  cancellation  such  as  encountered  in  the  above  example  is 
benign  and  need  not  be  forbidden. 


3.  Tracking 

Once  we  have  stabilized  our  feedback  system  we  may  use  the  remaining 
design  latitude,  the  choice  of  a  stable  w(s),  to  meet  various  system  design 
constraints.  The  first  of  several  such  design  constraints  which  we  will  con¬ 
sider  are  the  asymptotic  tracking  and  disturbance  rejection  conditions 
wherein  we  require  that  the  system  asymptotically  follow  or  reject  a 
prescribed  input  (4]. 

In  the  tracking  (or  asymptotic  regulator)  problem  it  is  desired  to  design  a 
stable  feedback  system  whose  output,  v2,  asymptotically  follows  a  pre¬ 
scribed  input  which  we  model  by  the  impulse  response  of  a  transfer  function 
f(s),  as  illustrated  in  Figure  2.  As  usual  we  assume  that  r(s)  admits  a 


Figure  2.  Feedback  system  with  tracking  generator. 
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coprime  fractional  representation  in  the  form 

as ) 


*,(*) 


where  there  exist  stable  u,(s)  and  v,(s)  such  that 
ut(s)nt(s)  +  v,(s)d,{s)  -  1 
We  say  that  the  system  tracks  the  impulse 


of  t(s)  if 


(3.1) 

09 


tis) -h^t(s)t(s)  m  A,1<tt (s)t(s)  (3.3) 

is  stable.  Recall  that  the  impulse  response  of  a  .single-variate  system  is 
asymptotic  to  zero  if  and  only  if  the  corresponding  transfer  function  is 
stable.  Thus  the  response  of  our  system  to  the  impulse  response  of  t(s)  will 
be  asymptotic  to  the  impulse  response  of  Ms)  if  and  only  if  the  transfer  of 
equation  3.2  is  stable. 

Recall  from  Corollary  6  that  ~ 

*,,„,(*)  •  te U)ttp(s)dp(s)+op(s)dp(s)  0.4) 

bcnce  if  we  desire  to  stabilize  the  system  and  simultaneously  cause  it  to  track 
the  impulse  response  of  t(s)  we  must  choose  a  stable  w(s)  such  that 
*<*)»,(*)  +  vp(s)  is  not  identically  zero  and 

[»{s)nAs)d  As)  +  0  (s)d  (s)  ] n,(s) 

Vi  <*>'<*>  *  - '  '■  ' - —  »» 

is  stable. 


11.  Property.  Given  p(j)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  2  which  stabilizes  the  system  and  simultaneously  causes  it 
to  track  the  impuslse  response  of  t{s)  if  and  only  if  the  equation 

w(s)np(s)dp(s)  +x(s)dtO)  -  *,(*)*,(*)  -1 

admits  suble  solutions  w(s)  and  x(s)  such  that  w(s)np(s)  +  vp(s)  is  not 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

l  -  w(s)dAs)  +  u.(s)  J 

C(l)  ■  ■  r  ■■  *  i 

lw(s)«#(s)  +  e,(s)J 

where  w(s)  is  a  solution  to  the  above  equation. 

Proof.  If  there  exists  a  stable  w(s)  such  that  3.3  is  stable  then  it  follows 
from  3.2  that 


lw{s)n,is)dp{s)  +v,[s)dp{s)\ 
d,(s) 
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[  "(s)np{s)dp(s)  +  vp(s)dp(s)\  (u, («)*,(*)  +  v,(s)dt(s)  ] 

7Js) 


(3.0 


'  I  w(s)np(s  )dp(  s)  +  vp(*)dp(t)  }*,(*) 
.  rffU) 


*,<*)  +  lwU)"p(s)dp(s)  +  vp(s)dp{s)  1  o,(j) 


-  —x(s) 

is  suble  since  it  is  expressed  as  the  sum  of  products  of  stable  functions.  Re¬ 
arranging  3.6  and  invoking  2.13  then  yields 

W(s)np(s)dp(s)  +x(s)d,(s)  -  -trp{s)dp{s)  «  up{s)np(.s)  -  1  (3.7) 

as  required.  Conversely,  if  3.7  admits  stable  solutions,  w(s)  and  x(s), 
where  w(s)np(s)  +  vp(s)  is  not  identically  zero  we  define  c(s)  by 

(-w  (s)dp{s)  +upls)) 

[  w(s)np  (s)  +vp(s)l 

using  the  w(s)  of  3.7.  Now,  with  this  w(j)  3.7  and  3.3  imply  That 


c{s) 


(3.*) 


Vi  <*>'('> 

[-jr(s)d,(j)]n,(s) 


[w(j)  njlp(s)  +  vp(s)dp(s))nt(s) 

7J7) 


(3.0 


d'(sj 


-x{s)nt{s) 


is  stable.  Since  the  subilization  theorem  implies  that  any  c(s)  in  the  form  of 
3.8  stabilizes  the  system  while  3.9  implies  that-h,|M](j)r(s)  is  suble  for  this 
choice  of  w  (s)  we  have  constructed  the  desired  compensator. 

Tracking  Theorem:  Given  p(s)  there  exists  a  compensator  for  the  feed¬ 
back  system  of  Figure  2  which  stabilizes  the  system  and  simultaneously 
causes  it  to  track  the  impulse  response  of  t{s)  if  and  only  if  n As)  and 
d,  (s)  are  coprime.  In  this  case  let  w„(s)  and  t^,(s)  be  stable  functions 
such  that 

ii„(*)fi,(s)  +vpr(s)d,(s)  «  1 

and  let  a(s )  *  ng(s)/dg(s )  be  a  coprime  fractional  representation  of 
a(s)  *  dp(s)/d,(s).  Then  the  desired  set  of  compensators  take  the  form 

[  -  w(r)<yr)  +  up(s)] 

C(S)  *  l»U)np{s)  +vp{s)] 

where 

*V(J) 


-upl{s)vp[s)  +e{s)de(s) 
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with  e(s)  an  arbitrary  stable  funaion  such  that  w(s)np{s)  +  vp(s)  is  not 
identically  zero. 

Proof.  Consistent  with  Property  11.  it  suffices  to  show  that  the  equation 

*is)np(s)dp(s)  *xis)d,(s)  -  up{s)np(s)  -  1  O.M) 

admits  stable  solutions  w(s)  and  x(s)  such  that  *{s)np(s)  +  v  As)  is  not 
identically  zero  if  and  oniyif  np(s)  and  dt(s)  are  coprime.  In  this  case  we 
then  show  that  the  appropriate  w(s)  takes  the  form 

w (#)  ■  -um(s)vp(s)  +e{s)d'{s)  0.11) 

for  any  stable  e(s) . 

If  3.10  admits  stable  solutions  w(s)  and  x{s)  then  it  follows  from  3.10 


that 

•t/s)xpW  -  w{s)itp(s)dp(s)  -x(s)d,(s)  -  1 

0.12) 

or  equivalently 

[up(s)  -»{s)dp(s)\np{s)  ♦  l-x(r)ld,(r)  -  1 

0.13) 

showing  that  np{s)  and  d,(s)  are  coprime.  Conversely,  if  np(s)  and  d,(s)  are 
coprime  there  exists  u^(s)  and  i /^(s)  such  that 

«p,(r)«p(s) +Vp,(s)d,(s)  •  1  _ 

0.14) 

from  which  it  follows  that 
up(s)np{s)-l  •  -vp{s)dp(s) 


«  -  [*tm(s)np{s)  +  vm(s)d,(s)vp(s)  )vp(s)dp{$)  0.15) 

■  ( -Mp,  (*)*>(#)  \np(s)dp(s)  +  [-vp,(s)vp{s)dp(s)]d,(s) 


As  such. 

wP{s)  -  -»p(r)j;(») 

0.10 

and 

xUs)  ■  -v,,(j)v,(r)d,(r) 

0.17) 

represent  particular  solutions  to  3.10. 

To  construct  homogeneous  solutions  3.10  we  define  a  transfer  function 
a(s)  by  a{s)  *  dp(s)/d,(s)  and  let 

nAs) 

‘u)  '  TFT 

0.18) 

be  a  coprime  fractional  representation  for  a(s).  It  then  follows  from  Pro 
perry  2  that  there  exists  a  stable  k(s)  such  that 


22k 


*  •  a  « 


Cv'.-Vv'v'v:- 


•_v. 
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and 


dp(s )  *  na(s)k(s) 


(3.19) 


d,(s)  *  da(s)k(s)  (3.20) 

Thus  if  we  define  candidates  for  the  homogeneous  solutionion  of  3.10  by 

w*(s)  -  e(s)da(s)  (3.21) 

and 


x*(j)  *  -e(s)np(s)na{s)  (3.22) 

where  e(s)  is  an  arbitrary  stable  rational  function  we  have 
wk(s)np[s)dp(s)  +x*(j)d,(j) 

(3.23) 

■  e(s)da(s)/ip(s)na(s)f(s)  -e(s)np(s)na(s)de(s)k(s)  «  0 
verifying  that  3.21  and  3.22  are,  indeed,  homogeneous  solutions. 

To  complete  the  solution  of  3.10  we  must  show  that  all  homogeneous 
solutions  are  of  the  form  3.21  and  3.22.  To  this  end  assume  that  wA(s)  and 
ka(s)  are  stable  and  satisfy 


h. M(s)np(s)dp(s )  +xk(s)dl(s)  ■  0  - 

(3.24) 

and  define  e(s)  by 

e(s)  *  **(f)da(s) 

(3.25) 

Clearly, 

w*(r)  *  e(s)da(s) 

(3.26) 

while  it  follows  from  3.24  that 


x*U) 


-wh(s)np(s)dp(s) 

d,(s) 


-wf'(s)npa(s) 


(3.27) 


-*» ’!'(s)nAs)na(s) 

*  - JjJ) -  *  -e(s)np{s)na(s) 

showing  that  wk{s)  and  x*(s)  have  the  form  of  3.21  and  3.22.  It  remains, 
however,  to  verify  that  e(s)  is  stable  for  which  purpose  we  invoke  equations 
3.14  and  the  coprimeness  of  na(s)  and  da(s)  from  which  it  follows  that 
there  exists  stable  ue(s)  and  va(s)  such  that 

uals)na(s)  +  va(s)da(s)  =  1 
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as  such, 

W*(S)  I  W*U)1 

'is' '  '  by«j  ♦».<*><.«>  i 


w*(s)«,(s)«#(,> 


+  w*<s)i/#(r) 


,  ,  W*(i)u As)d As) 

+  w*(f)u#(J)  ■  - d  ^  - —  +  w*(r)va(f) 


’w*(s)ut 

’  L  *, 


{«p,  +  v0(s)d,is) )  +  w*(*)»,(r) 


w*(j)k  (f)rf  (#)m  (*)n  (j) 

■  = - i — tft(s)  * - 2 —  +  —WuaM*pls)vi*W  +  **(*>*',(*> 

«  -x*(r)u#(*)u^(s)  +  w*(j)u#(#)rfp(f)t^,U)  +  W*(5)W#(I) 

is  suble  since  we  have  expressed  it  as  the  sum  of  products  of  stable  func¬ 
tions.  Note,  the  last  equality  in  3.29  follows  from  3.24. 

The  solution  space  for  Equation  3.10  thus  takes  the  form 

w{s)  «  -vpf(x)t//>(j) +*(x)<f#(r)  (3-36) 

and 

x(x)  >  ~ypl{s)vp(s)dp(s)  -t(s)np{s)nt(s)  (3.31) 

As  such.  Property  11  implies  that  if  the  w(s)  of  Equation  3.30  is  used  to 
define  a  stabilizing  compensator  as  per  the  stabilization  theorem  it  will  also 
cause  the  system  to  track  the  impulse  response  of  T(s) .  Of  course,  we  must 
also  assume  that  e(s)  is  chosen  so  that  *>(s)np(s)  +  v As)  is  not  identically 
zero.  To  complete  our  proof  that  the  coprimeness  of  np(s)  and  d,(s)  is  a 
sufficient  condition  for  the  solution  of  the  tracking  problem  it  thus  suffices 
to  show  that  there  exists  at  least  one  choice  of  e(s)  such  that 
w(s)np(s)  +  vp(s)  is  not  identically  zero.  From  3.30  it  follows  that 

w(t)np(s)  +vp(s)  ■  [-^(»)^U)  +  +  «(i)rf,(t)»;(f)  (3.32) 

Now,  dt{s)  is  not  identically  zero  since  it  is  the  denominator  of  a(s).  As 
such,  if  np(s)  is  not  identically  zero  it  follows  from  3.32  that  the  value  of 
"{s)np(s)  +i ip(s)  is  a  non-trivial  function  of  e{s)  and  is  therefore  not 
identical  to  zero  for  all  e(s).  On  the  other  hand  if  np(s)  ■  0  then  2.15  im¬ 
plies  that  vp(s)  is  miniphase  which,  in  turn,  implies  that  w(s)np{s)  +  t^(s) 
m  vp(s)  is  not  identically  zero.  Our  proof  is  therefore  complete. 

Although  the  proof  of  our  theorem  is  long,  though  elementary,  the  basic 
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result  to  the  effect  that  a  compensator  exists  which  will  simultaneously 
stabilize  the  system  and  cause  it  to  track  the  impulse  of  t(s)  if  and  only  if 
rtp(s)  and  d,(s)  are  coprime  is  simple  to  check  (no  common  right  half-plane 
zeros).  Moreover,  the  construction  of  the  required  compensator  is  simply  a 
matter  of  substitution  as  per  the  following  example. 

12.  Example.  Consider  the  problem  of  designing  a  compensator  for  the 
plant  of  Examples  4  and  10  so  that  the  system  is  stable  and  asymptotically 
tracks  a  step  function.  Recall  that 


P(s) 


(s  +  1) 


(j2-4) 


(s+1) 


(s  +  2)2 


where 


(s-  2) 
(s  +  2) 


"pis) 

dpls) 


(3.33) 
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"(s +  2/3)" 

(s— 2) 

• 

3 

,(s  +  2)2_ 

(s  +  2) 

(s  +  2) 

tup(s)np(s)  +  vp(s)dp(s) 


1  (3.34) 


While  we  take  t(s)  =  1/s  from  which  we  obtain 


t(s)  =  - 
s 


1 


(s  +  2) 


r 


(s  +  2) 


«,(s) 

d^s) 


(3.35) 


Now,  n  is)  has  no  right  half-plane  zeros  while  d,(s)  has  a  right  half-plane 
zero  at  s  =  0.  As  such,  np(s)  and  d,(s)  are  coprime  and  according  to  the 
theorem  the  desired  compensator  exists.  Indeed, 


(s+1) 

(s+2)2 


(s+2) 


s 

(s  +  2) 


upt(s)np(S)  +vpt(s)d,(s)  -  1  (3.36) 


As  a  final  step  in  the  construction  of  our  compensator  we  let 

(s-2) 


a(s) 


which  is  coprime  since 


dpls) 

d,(s) 


(s  +  2) 


(s  +  2) 
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dAs) 
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w  (s+2) 

(s+2) 

vff(s)na(s)  +va(s)da(s) 


(3.38) 
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It  thus  follows  from  the  theorem  that  the  desired  w(s)  takes  the  form 


w(s)  *  -u  is)  v As)  +  e(s)d.is) 


r  -  4(j+2/3)  s 

— 1 - !  +  -  e{s)  (3.39) 

(r  +  2)  J  L(s+2)„ 


where  e(s)  is  an  arbitrary  stable  function.  Substitution  of  any  such 
w(s)  into 

(-w(s)d As)+u As)  l 

e(s)  m  - 2 - 1 -  0.40) 

[*(*)«,(*) +«,(«)] 

thus  defines  the  required  compensator  so  long  as  e(s)  is  chosen  so  that 
w(s)np(s)  +  vpis)  is  not  identically  zero.  To  verify  our  solution  we  may 
substitute  w(s)  into  the  formula  for  A,,*,  (s)  of  Corollary  6  obtaining 


^«i#i  W 


— ~  (J(f- 
|_(r+2)4_ 


2)(f+2/3)  +  (s+l)(j-2)e(f)J 


Since  A.t»t  (s)  has  a  zero  at  s  =  0  the  required  tracking  property  may  now  be 
verified  by  the  final  value  theorem. 

Now  let  us  consider  an  alternative  problem  where  we  are  required  to 
track  e*U  (/).  Here  our  tracking  generator  is  defined  by 


1 

Li-1  .  <f+2>J  . 

Lu-2)J  (r— 2)1  d,U 


As  before  np(s)  and  d,is)  are  coprime  since 
-6]  +  [  ——l  L— H  -  U  is) n As)  +v  (s)d,(s)  -  1  (3.43) 

1.3.1  lu+2)*J  L  (J+2)  J  Lu  +  2)J  *  '  *  ' 


Finally,  for  this  example  we  have 


(J~2)' 

rfp<*>  _  1  _  "«<■ 

d,(s)  "  f  (J-2)  "7  *  d,( 
1(4+2). 


where 


[0]  [1]  +  mil]  -  U'[s)nt{s)  +  vt(s)dt(s)  «  I  (3.45) 

Note  that  in  this  example  dpis)  and  d,(s)  are  not  coprime  since  they  have  a 
common  right  half-plane  zero  at  s  *  2.  For  our  purposes,  however,  all  that 
is  required  is  a  coprime  fractional  representation  for  a  is)  =  d0  is)  Id,  is)  as 
constructed  above.  Using  these  new  values  for  up,is)  and  dt(s)  we  obtain 
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for  which  the  zero  at  s  *  2  indicates  tracking.  Note  that  every  stable  w(s)  is 
obtained  for  some  e(s)  and  hence  all  stabilizing  compensators  track 
(0  in  this  example. 


4.  Disturbance  rejection 

There  are  two  alternative  disturbance  rejection  problems  which  arise 
naturally  in  our  feedback  system  theory.  Figure  3a  indicates  the  configura¬ 
tion  for  the  input  disturbance  rejection  problem  [16]  wherein  we  desire  to 
design  a  compensator  which  simultaneously  stabilizes  the  system  and  causes 
it  to  asymptotically  reject  the  impulse  response  of  r(s ),  i.e.,  the  response  of 
the  system  to  the  impulse  response  of  r(s)  should  be  asymptotic  to  zero. 


Figure  3.  Feedback  system  configuration  for  a)  the  input  disturbance  rejection  prob- 
blem,  b)  the  output  disturbance  rejection  problem,  and  c)  a  modified  configuration 
for  the  output  disturbance  rejection  problem. 


A  similar  output  disturbance  rejection  problem  [4]  is  illustrated  in 
Figure  3b.  Here,  the  distrubance  is  injected  into  the  system  at  the  plant  out* 
put  and,  as  before,  it  is  desired  to  design  a  compensator  which  simul¬ 
taneously  stabilizes  the  system  and  causes  it  to  asymptotically  reject  the  im¬ 
pulse  response  of  r(s) .  Surprisingly,  however,  the  output  disturbance  rejec¬ 
tion  problem  is  completely  equivalent  to  the  tracking  problem  considred  in 
the  previous  section.  To  see  this  simply  observe  that  the  block  diagram  of 
Figure  3c  is  equivalent  to  that  of  Figure  3b.  As  such,  if  we  design  a  compen¬ 
sator  to  stabilize  the  system  and  cause  the  planr  output  to  asymptotically 
track  the  impulse  response  of  —  r(s)  when  the  impulse  response  of  r(s )  is 
added  to  the  plant  output  the  total  effect  of  the  disturbance  observed  at  *, 
will  be  asymptotic  to  zero.  Consistent  with  the  above  we  give  no  further 
consideration  to  the  output  disturbance  rejection  problem  since  it  may  be 
resolved  via  the  techniques  of  the  previous  section  with  r(r)  =  —  r( s) .  In¬ 
deed,  one  can  solve  the  tracking  and  output  rejection  problem 
simultaneously  by  working  with  tracking  generator  r(s)  -r(s). 

For  the  input  disturbance  rejection  problem  we  require  that  the  impulse 
response  of 

AVj(s)r(f)  ■  l-w(s)np(s)dp(s)  +  up(s)np(s)  ]r(t)  (4.1) 

be  asymptotic  to  zero.  Hence  to  simultaneously  stabilize  the  feedback 
system  and  cause  it  to  asymptotically  reject  the  impulse  response  of  r(r)  we 
must  choose  a  stable  w(s)  such  that  w(s)np(s)  +op{s)  is  not  identically 
zero  and  (s)r(j)  is  stable.  The  required  theory  [10]  is  essentially  iden¬ 
tical  to  that  used  to  solve  the  tracking  problem  and  hence  we  simply  state 
the  pertinent  theorems  without  proof.  For  this  we  let  r(s)  »  nr(s)  / dr(s )  be 
a  coprime  fractional  representation  for  r(s). 

13.  Property.  Given  p(s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  3a  which  stabilizes  the  system  and  simultaneously  causes  it 
to  reject  the  impulse  response  of  r(s)  if  and  only  if  the  equation 

w(s)npis)dpls)  +  y(s)dr{s)  -  up{s)np(s) 

admits  stable  solutions  w{s)  and  y(s)  such  that  w(s)nfi(s)  +  vp(s)  is  not- 
identically  zero.  In  this  case  the  required  compensator  takes  the  form 

f-w(r)d  <*)+*  (*)] 
c(s)  ■  — - - 2 - £ - 

where  w(r)  is  a  solution  to  the  above  equation. 

Disturbance  Rejection  Theorem.  Given  p(s)  there  exists  a  compen¬ 
sator  for  the  feedback  system  of  Figure  3a  which  stabilizes  the  system 
and  simultaneously  causes  it  to  reject  the  impulse  response  of  r(r)  if  and 
only  if  dp(s)  and  dr(s)  ire  coprime.  In  this  case  let  u^s)  and  v^is) 
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be  stable  functions  such  that 

upf(s)dp(s)  +  i ipr(s)drls)  m  i 

and  let  b(s)  »  nb(s)/db(s)  be  a  coprime  fractional  representation  of 
b(s)  *  np(s)  /dr(s) .  Then  the  desired  set  of  compensators  take  the  form 

f  -  *(s)d As)  +  u As)  1 

c(s)  »  — - £ - £ - 

(w(j)n(j)  +t»(s)J 


where 


w(s)  «  uprls)up(s)  +  f(s)db(s) 


with/(s)  an  arbitrary  stable  function  such  that  w{s)np{s)  +  vp(s)  is  not 
identically  zero.  ' 

14.  Example.  Continuing  with  the  plant  of  Example  12  let  us  consider  the 
problem  of  designing  a  compensator  to  reject  a  step  function,  i.e.,  we  let 


’ll  _  .  <*+2)]  »,< 


-‘J  (7“  +  2j  (777}  -V^w+vW)-i  («) 


showing  that  dp(s)  and  dr(s)  are  coprime.  Finally,  we  let 

'  U+1)  ' 

"p(s)  L(i  +  2)2J  n»(s) 


which  is  clearly  coprime.  From  the  theorem  the  required  w(s)  take  the  form 


w-(s)  *  upr(s)up(s)  +/(s)db(s)  »  -  + 


where /(s)  is  arbitrary  stable  function  such  that  w(s)/tp(s)  +  vp(s)  is  not 
identically  zero.  The  use  of  the  compensator  derived  from  this  w(s)  then 
results  in  the  gains 
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-  <J~2)  I  [(s  +  2X3rJ-8#-l2)  »  3r(i+l)/(r)l  (4.7) 

U(J  +  2)4J 

Here,  the  fact  that  ( s )  has  a  zero  at  s  ■  0  indicates  that  the  disturbance 

rejection  specification  has  been  achieved  while  the  fact  that  ( s )  has  a 

zero  at  s  »  2  implies  that  the  system  also  tracks  e^U  (r).This  «  consistent 
with  Example  12  for  the  system  tracks  e^Ufr). 


S.  Simultaneous  tracking  tad  disturbance  rejectkMi 

The  purpose  of  this  section  is  to  combine  the  results  of  the  previous  two  sec* 
tions  by  formulating  criteria  for  the  design  of  a  compensator  which 
simultaneously  stabilizes  the  systegi,  causes  it  to  track  the  impulse  response 
of  t(s)  and  causes  it  to  reject  the  impulse  response  of  r(s)  {16].  The  ap¬ 
propriate  feedback  system  configuration  is  shown  in  Figure  4  where  r(s)  is 
taken  to  be  an  input  disturbance.  Of  course,  an  output  disturbance  can  also 
be  included  in  the  theory  simply  by  combining  it  with  the  tracking 
generator. 


tft) 

L_tJ 

ZjsJ 

r—  ■■ 

eft) 

ri 

_ 

Figen  4.  Configuration  for  the  simultaneous  tracking  and  disturbance  rejection 
problem. 

For  consistency  with  the  previous  sections  we  will  use  the  same  notation 
which  is  reviewed  as  follows.  Our  plant  is  assumed  to  have  a  coprime  frac¬ 
tional  represemion 

n  As) 

OlS)  m  —1 -  (S.l) 


such  that 

up(s)np(t)  +up(s)dp(s)  «  1  (5.2) 

while  the  tracking  and  rejection  generators  are  characterized  by  coprime 
fractional  representations 
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and 


r(s) 


nr(s ) 
d,(s) 


Also  as  in  the  previous  sections  we  define  a(s )  and  b(s)  by 

nt(s) 


and 


o(s) 


bis) 


dp(s ) 
d,(s) 


dpis) 

dris) 


dAs) 


(5.4) 


(5.5) 


(5.6) 


ntis) 
d,is) 

nb(s)/db(s)  are  coprime  in  the 


where  a(s)  -  na(s)/nt(s)  and  b(s) 
sense  that  there  exists  stable  ue(s),  va(s),  ub(s),  and  vb(s)  such  that 

uais)na(s)+va(s)  da(s)  m  l 


(5.7) 


and 


ubis)nbis)+vbis)dbis)  -  1  (5.*) 


Moreover  it  follows  from  5.5  and  5.6  together  with  property  2  that  there 
exists  stable  k(s)  and  mis)  such  that 

d,is)  *  dais)kis)  and  dp(s)  -  n#(s)*(s)  ”  (5.9) 

and 


dris)  ■  dbis)mis)  and  npis)  «  nbis)m(s)  (5.10) 

Finally,  the  coprimeness  conditions  for  the  tracking  and  disturbance  rejec¬ 
tion  problems  are  characterized  by 

upiis)npis) +vplis)d,is)  *  l  (5.11) 

and 


upris)dpis)  +vpris)dr[s)  -  1  (5.12) 

while  we  will  also  require  a  coprimeness  condition  between  d,{s)  and  dr(s) 
which  we  characterize  by  the  equation 

utris)d,is)  +v,ris)dris)  -  I  (5.13) 

With  this  review  of  notation  in  hand  the  required  design  equations  for 
the  simultaneous  tracking  and  disturbance  rejection  problem  can  be  ob¬ 
tained  simply  by  combining  the  results  of  Property  11  and  Property  13  and 
observing  that  both  design  equations  must  be  satisfied  by  the  same  w(s) 
since  we  desire  to  construct  a  single  compensator  which  simultaneously 
solves  both  problems. 
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15.  Property.  Given  p(s)  there  exists  a  compensator  for  the  feedback 
system  of  Figure  4  which  stabilizes  the  system,  causes  it  to  track  the  impulse 
response  of  t(s)  and  simultaneously  causes  it  to  reject  the  impulse  response 
of  r(s)  if  and  only  if  the  pair  of  equations 

w(s)*,<s)d,(s)+x<s>d,{r)  ■  up(s)Hp{s)  - 1 
and 


»(s)"pls)dp{s)  +y(r)d,(J)  •up(s)np(s) 

admit  stable  solutions  w(s),  x(s),  and  y(s)  such  that  w(s)np(s)  +vp(s)  is 
not  identically  zero.  In  this  case  the  required  compensator  takes  the  form 

[  -  w(s)dAs)  +*(*)] 

e(s)  m - i - 1 - 

l»is)np(s)  +e,(r)  J 

where  w(s)  is  a  solution  to  the  above  equations. 

StfuHaaeoas  Tracking  and  Disturbance  Rejection  Theorem:  Given 
p(s)  there  exists  a  compensator  for  the  feedback  system  of  Figure  4 
which  stabilizes  the  system,  causes  it  to  track  the  impulse  response  of 
t(s)  and  simultaneously  causes  it  to  reject  the  impulse  response  of  r(s)  if 
and  only  if  • 

(i)  n As)  and  d,(s)  are  coprime, 

0i)  J  As)  and  d,(s)  are  coprime,  and 
(hi)  a,{s)  and  rf,(s)  are  coprime. 

In  that  due  the  desired  set  of  compensators  take  the  form 

l-w(s)dAs)  +  *(*)! 

c(s)  - - - 1 - 

lw<r)«j(j)  +  »,(*)  J 

where 

"(*)  •  (vpr(s)«v(s)«r(s)df(s)  ’-np,{s)vp{s)vp(s)dr(s)\  +f(s)d4(i)d,(s) 

with  g (s)  an  arbitrary  stable  function  such  that  w(s)np(s)  +  (/,(*)  is  not 
identically  zero. 

Proof.  The  fact  that  np{s)  and  d,(s)  must  be  coprime  follows  from  the 
tracking  theorem  while  the  fact  that  dp{s)  and  dr{s)  must  be  coprime  follows 
from  the  disturbance  rejection  theorem.  To  verify  that  d,(s)  and  dr(s)  must 
also  be  coprime  for  simultaneous  stabilization  we  subtract  the  two  design 
equations  of  Property  15  obtaining 

Ly(r)  ]*,<*)  +  l-x(s)]d,(s)  -  1  (5.14) 

Conversely,  to  show  that  the  three  coprimeness  conditions  are  also  suffi¬ 
cient  we  must  construct  a  w(s)  which  simultaneously  satisfies  the  criteria  for 
tracking  and  disturbance  rejections  derived  in  the  preceeding  sections. 
Upon  invoking  the  resalts  of  the  tracking  and  disturbance  rejection 
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!• 


theorems  we  must  therefore  solve 

~upSs)vp(s)  +  e(s)da(s)  *  w(s)  =  upr(s)up(s)  +/(s)db(s)  (5.15) 

for  stable  e(s),f(s),  and  w(s) .  Since  the  required  set  of  stable  w(i')  may  be  ob¬ 
tained  by  substitution  once  e(s)  and  f(s)  have  been  parameterized  our  main 
problem  is  to  characterize  the  stable  solutions  of 

e(s)dt(s)  -f(s)db(s)  *  lupr(s)up(s)  +upl(s)vp(s) )  (5.16) 

To  obtain  a  particular  solution  for  5.16  we  invoke  5.9,  5.10,  and  5.13 
obtaining 

lupf(s)up(s)  +upl(s)vp(s)] 

*  [upr(s)up(s)  +Upl(s)vp(s)  ]  [u,,(j)d,(f)  +  vtr(s)dr(s)  ] 

*  [upr(s)up(s)  +upt(s)upis))lutr(s)k{s)da(s)  +vtr(s)m(s)db(s)} 

*  {[upr(s)up(s)  +  upt(s)vp(s)]ulr(s)k(s)]da(s) 

+  [lupr(s)up(s)  +upt(s)  vp(s)]v,r(s)m(s)]db(s)  (5.17) 

As  such,  the  required  particular  solutions  take  the  form 

e'(s)  «  [upr(s)up(s)  +upt(s)vp(s) \ulr(s)k(s)  (5.18) 

and 

P\s)  *  -  fu^U) ii^i)  +up,U)i>/,(s))u„U)w(j)  (5.19) 

Of  course, 

eh(s)  -  g(s)db(s)  (5.20) 

and 

fHs)  -  g(s)dB(s)  (5.21) 

represent  homogeneous  solutions  to  5.16  for  any  stable  g(s)  since 

e*(s)dBU)  -/*(j)<yj)  »  g[s)db[s)da(s)-g(s)da(s)db(s)  *0  (5.22) 

As  such,  our  characterization  of  the  solution  space  for  5.16  will  be  complete 
if  we  can  verify  that  all  stable  homogeneous  solutions  to  5.16  take  the  form 
of  5.20  and  5.21  for  some  stable  g  (s) .  To  this  end  let_e*(s)  and  fh(s)  be  ar¬ 
bitrary  stable  homogeneous  solutions  to  5.16,  i.e.,  ~ 

±h{s)da(s)  -fh(s)db(s)  =■  0  (5.23) 

Now,  define  g(s)  by  g(s)  =  _eA(s) /db(s)  in  which  case 

_e*(s)  »  g(s )db{s)  (5.24) 
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while  5.23  implies  that 

e*{s)dAs) 

f{s)  »  = - *  g(s)da{s)  (5.25) 

db(s) 

showing  that_e*(r)  and/*(s)  are  of  the  required  form.  It  remains,  however, 
to  show  that  g(s)  is  stable.  Indeed, 


g(s) 


_£*U)  e*(s> 

* - -  — - l*bls)itb(s)  +vb(s)db(s)) 

db{s)  db{s ) 


*jr*(s)ub(s)  + 
•J*(s)vb(s)  + 


.£*(*  )ubls)/tb(s) 


dbM 

- - - (u„(r)d,(r)  +  vu{s)dr{s) ) 

d,U)_ 


•e?(s)vbW  +j*(s)ubls)npis)v„(s)  + 
*  .£*<»)«*(*)  +f'M{s)ub{s)nt(s)vtr{s)  + 


_el,(s)ub(s)np(s)utrls)d,ls) 

_ 

ek(s)ubls)Hb{s)utr{s)d,(s)kis) 
~db(s)  “ 


•**(s)ubU)  +jk(s)ub(s)np(s)vlf(s)  +J*(s)ub(s)nb(i)utr(s)lHs)  (5J« 

showing  that  g(s)  is  subie  since  it  has  been  expressed  as  the  sum  of  prod¬ 
ucts  of  subie  functions.  Here,  Equation  5.26  was  derived  with  the  aid  of 
Equations  5.6,  5.8,  5.9,  5.13,  and  5.23.  As  such,  the  complete  set  of  solu¬ 
tions  to  5.16  take  the  form 


•is)  »  [upK(s)up{,s)  +«,,(*)»,(*)  }«„U)*U)+f(f)dfc(r)  (5.27) 

and 

As)  m  [Upr(s)uf{s)  +«|K(s)eJ,(s)]  u,r(s)m(s)  +g(s)d,(s)  <5-2S> 


i 

*•> 

cS 

kV 

% 


Now,  upon  substituting  either  of  these  expressions  into  5.15  we  obtain  the 
desired  expression  for  w(r)  in  the  form 


"<*>  •  -*V<*)«V(,>  +l 'tPrl*)upW  +up,(s)vp(s)u,r(s)k(s)daU)  +g{s)db(s)dp{s) 

*  upr{s)up[s)u0.{s)d,{s)  +up,(s)up[s)ulr(s)dt{s)  -1+  g(s)db{s)da{s ) 

*  l»prls)up{s)utr{s)d,{s) -upl(s)vp(s)vtr(s)dr{s)]+g[s)db{s)da{s)  (5.29) 
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where  g (s)  is  an  arbitrary  function. 

Finally,  to  complete  the  proof  that  the  three  coprimeness  conditions  suffice 
for  simultaneous  tracking  and  distrubance  rejection  we  must  verify  that  there 
exists  one  choice  of  a  stable  g(s)  such  that  *'(s)nfi(s)  +  vp(s)  is  not  iden¬ 
tically  zero.  Upon  substituting  5.29  into  this  expression  we  obtain 

"(s)np(s)  +  vp(s)  -  lupr(s)upls)utr(s)dlls) -upt(s)op(s)vtr(s)dr{s)  ]np(s) 

(5J0) 

+  t >„{s)  +g(s)db(s)da(s)np(s) 

Now  db(s)  and  dt(s)  are  not  identically  zero  since  they  represent  the 
denominators  for  well  defined  transfer  functions.  As  such,  if  n  As)  is  not 
identically  zero  S.30  will  be  non-trivially  dependent  on  g(s)  and  hence  not 
identically  zero  for  every  choice  of  g(s).  On  the  other  hand  if  np(s)  is  iden¬ 
tically  zero  the  equality  up(s)np(s)  +  vp(s)dp(s)  «  1  implies  that  vp(s)  is 
miniphase  hence 

w(s)np[s)  +  vp(s)  m  vp[s)  (5-31) 

is  not  identically  zero.  Our  proof  is  therefore  complete. 

Although  the  theorem  is  highly  complex  and,  indeed,  is  predicated  on  the 
equally  complex  theorems  which  preceded  it,  the  final  result  is  an  explicit 
description  of  the  desired  family  of  compensators.  Moreover,  the  terms  in 
this  expression  are  readily  computed  by  solving  one  or  more  coprimeness 
equation.  As  such,  the  result  is  easily  implemented  as  per  the  following  ex¬ 
ample. 

16.  Example.  Continuing  our  analysis  of  the  plant  introduced  in  the 
previous  examples  we  will  investigate  the  possibility  of  simultaneously 
tracking  e^Ufr)  and  rejecting  U  (t).  Here, 

(i-2) 

d,(s)  = -  (5.32) 


(s  +  2) 


which  are  clearly  coprime.  Indeed 


(r-2) 


=  ulr(s)d,(s)+v,r(s)dr(s)  -  1  (5.34) 


As  such,  the  required  w(j)  takes  the  form 


— ( j2  +  4-*  +  4)  +  s(s  +  2)g{s)  I 
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Saks  et  ax. 


yielding 

h^As)  -  j2L(,2+.i,.|.i£.  -  (s-2)(s+2)t(s)  (5.3d) 

L<i+2)*J  L  3  3  ■>  J 

and 

*  t  r  " 

L,U)  •  — — 2-  (9^+12^  + 32^-1 12* -144)  +  9»(#+lXr+2)g(r)  (3-37) 

11  |9<*+2)JJ  L  .  J 

which  have  the  required  zeros  at  s  ■  0  and  5*2,  respectively. 
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A  Design  Method  for  Two-Dimensional 
Recursive  Digital  Filters 


JOHN  J.  MURRAY,  member,  ieee 

Abstract— A  method  foe  designing  two-dimendofui,  syiriftietric  half-  t  n 


plane  tecnnive  digital  filter*  is  presented:  a  fitter  is  first  designed  as  a 
parametrized  family  of  one-dimensional  filters;  a  simple  approximation 
it  then  used  to  find  a  rational,  stable,  two-dimensional  filter.  Some 
advantages  and  disadvantages  of  the  method  are  discussed,  and  several 
examples  are  given. 

I.  Introduction 

N  THE  past  few  yean,  several  successful  algorithms  for  the 
design  of  recunive  multidimensional  digital  filters  have  been 
published  (e.g.,  ll]-[8]).  However,  it  is  probably  safe  to  say 
that  nothing  approaching  a  “universal”  design  procedure  has 
been  developed.  Of  course,  it  is  highly  unlikely  that  such  a 
procedure  exists,  in  view  of  the  fact  that  in  the  (much  simpler) 
one-dimensional  situation  it  is  already  clear  that  a  multiplicity 
of  design  methods  is  necessary  to  handle  the  variety  of  prob¬ 
lems  which  arise. 

Although  there  are  relatively  few  two-dimensional  situations 
in  which  recursive  filten  are  required,  there  are  many  in  which 
recunive  filten  may  be  considered  as  an  alternative  to  FIR 
filten.  A  major  drawback  in  the  application  of  recunive 
filten,  however,  has  been  the  complexity  and  expense  of  their 
design,  especially  when  one  is  merely  exploring  the  possibility 
that  they  may  be  more,  efficient  than  FIR  filters.  Indeed, 
there  is  evidence  that  in  many  situations  the  FIR  filters  will  be 
the  more  efficient  (9).  For  these  reasons,  most  designers 
simply  use  FIR  filten  rather  than  go  through  the  nonlinear 
optimization  procedures  usually  necessary  to  design  recursive 
filten  and  to  ensure  stability. 

A  gap  thus  appean  in  the  range  of  design  algorithms;  there 
appear  to  be  few  procedures  which  can  provide  a  relatively 
quick,  suboptimal  design  which  is  guaranteed  to  be  stable. 
This  gap  is  highlighted  by  the  fact  that  in  two-dimensional 
filtering  applications  it  is  relatively  rare  to  encounter  a  situa¬ 
tion  where  it  is  important  to  meet  precise  specifications  of  the 
type  encountered  in  one-dimensional  filtering,  e.g.,  limits  on 
the  passband  ripple  or  the  stopband  attenuation.  It  is  far  more 
common  to  be  given  an  idea  of  the  “shape”  of  the  filter  with¬ 
out  any  numerical  specifications.  The  design  method  described 
in  this  paper  is  directed  towards  this  sit  ition. 

The  basic  element  in  this  method  is  a  recursive  symmetric 
half-plane  filter.  This  is  in  contrast  with  the  asymmetric  half¬ 
plane  filters  designed  in  [4]  and  the  semirecursive  symmetric 
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Fig.  1 .  Support  set  for  symmetric  half-plane  filter. 

half-plane  filters  in  [10].  Preliminary  versions  of  the  present 
method  have  been  described  in  [11]  and  [12] .  It  may  be  out¬ 
lined  as  follows.  Beginning  with  a  frequency  specification  on 
the  torus 

T1  =  {(0 1 , 0*  )|  -  tr  <  0,  <  tr,  -  rr  <  6 2  <  ir) 
one  first  fixes  0lt  and  designs  a  classical  one-dimensional 
recursive  filter  in  the  variable  03.  Doing  this  for  each  B\  gives 
a  two-dimensional  filter  whose  coefficients  are  (in  general 
transcendental)  functions  of  0,.  One  then  approximates  these 
coefficient  functions  by  trigonometric  polynomials  in  0 ,.  The 
result  is  a  rational  symmetric  half-plane  two-dimensional  filter. 
The  major  problem  lies  in  carrying  oUt  the  approximation  in 
such  a  way  that  stability  is  preserved.  We  will  describe  an 
approximation  procedure  which  is  noniterative,  requires  a 
minimum  of  computation,  and  yields  filters  which  can  be 
proved  to  be  stable. 

II.  Symmetric  Half-Plane  Filters 
By  a  recursive  symmetric  half-plane  filter,  we  will  mean  a 
filter  whose  denominator  is  of  the  form 

B(ZUZ3)  =  I  +  f  £  bmnZ?Zl.  (1) 

n  - 1  m  - -M 

The  corresponding  support  set  is  shown  in  Fig.  1 .  One  advan¬ 
tage  of  these  filters  from  the  point  of  view  of  implementation 
is  immediately  clear;  they  are  ideally  suited  for  parallel  pro¬ 
cessing  since  the  output  at  any  point  depends  only  on  outputs 
from  the  previous  lines.  It  is  therefore  possible  to  process 
entire  lines  in  parallel. 

Another  advantage  of  this  class  of  filters  is  that  its  “stability 
set”  is  slightly  smaller  than  that  of  asymmetric  half-plane  filters 
and,  of  course,  considerably  smaller  than  that  of  quarter-plane 
filters.  The  stability  set  for  filters  of  the  form  (I)  is  the  set 
defined  by 

5={(Z„ZJ)eCl||Z,|=]  and  |Z,|  <  1}. 


«  i 1  if  '; *'j  I <!  1  *  i  i  ■  /!  ’•/'  ■  \ 

I  r-  iljl'.it;!'  ;  >1  }•;  >' r 
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Fig.  2.  A  90*  fan  Alter. 


(That  is,  if  the  denominator  (1)  does  not  vanish  on  the  set  5, 
the  corresponding  filter  will  be  stable.)  This  is  an  immediate 
consequence  of  the  corresponding  Tact  for  asymmetric  half¬ 
plane  filters  (4)  since  the  extra  one-dimensional  stability  con¬ 
dition  for  the  asymmetric  filter  is  automatically  satisfied  in 
the  present  case-the  function  in  question  is  identically  1.  We 
also  note  that  stability  of  the  two-dimensional  symmetric 
half-plane  filter  with  transfer  function  equal  to  H(Z% ,  Z2 )  is 
equivalent  to  stability  of  the  entire  family  of  one-dimensional 
filters  (in  Z2)  whose  transfer  functions  are 

H(eit'.Zt),  -#<9,<*. 

This  fact  follows  immediately  from  the  form  of  the  stability 
set  S;  it  will  be  important  in  the  design  procedure  presented 
below. 

Filters  of  the  form  (1),  however,  are  incapable  of  approxi¬ 
mating  a  general  magnitude  response.  The  restrictions  which 
must  be  placed  on  a  magnitude  function  to  enable 

it  to  be  approximated  by  a  stable  symmetric  half-plane  filter 
(with  constant  numerator)  may  be  stated  as  follows.  The 
average  gain  over  the  set  8X  “  constant,  i.eM 


£1> 


*(9„9,)  Mi 


must  be  a  constant  independent  of  9t.  A  simple  direct  proof 
of  this  condition  is  given  in  Appendix  I. 

It  is  clear  from  the  form  of  these  restrictions  that  in  order  to 
approximate  a  general  magnitude  response,  one  must  cascade 
a  symmetric  half-plane  filter  with  a  one-dimensional  filler  in 
Zy .  If  one  realizes  this  latter  filter  as  a  recursive  filter,  the 
cascade  will  be  an  asymmetric  half-plane  filter;  if  the  one- 
dimensional  filter  is  chosen  to  be  a  noncausal  filter,  the  cascade 
will  be  essentially  equivalent  to  a  semirecursive  symmetric  half- 
plane  filter.  In  both  cases,  the  final  filter  is  expressed  as  a 
product,  rather  than  the  more  usual  sum,  of  a  one-dimensional 
filter  and  a  symmetric  half-plane- filter.  (This  is  not  to  imply 
that  an  arbitrary  asymmetric  half-plane  or  semirecursive 
symmetric  half-plane  filter  can  be  factored  in  this  way.  How¬ 
ever,  the  class  of  magnitude  responses  which  can  be  approxi¬ 


mated  by  the  product  form  is  the  same  as  that  associated  with 
the  sum  form.  In  fact,  it  has  been  shown  [13]  that  if  one 
imposes  the  additional  constraint  that  the  magnitude  response 
of  a  rational  asymmetric  half-plane  filter  has  quadrants!  sym¬ 
metry  then  the  filter  can  be  expressed  exactly  as  a  product  of 
the  above  type.) 

One  of  the  advantages  of  working  with  filters  of  the  form 
(1)  is  the  flexibility  discussed  above-by  appropriate  choice  of 
the  one-dimensional  “compensating”  filter  one  can  realize 
either  asymmetric  filters  or  semirecursive  filter,.  A  further 
advantage  is  that  any  symmetries  in  the  specification  of  the 
filter  will  be  reflected  in  symmetries  of  the  coefficients  of  the 
filter.  Of  course,  other  choices  besides  the  two  discussed 
above  are  possible  for  the  one-dimensional  filter;  for  instance, 
one  could  amply  take  it  to  be  an  FIR  filter.  This  is  in  fact 
what  is  done  in  the  examples  presented  below. 

III.  The  Design  Procedure 

For  the  sake  of  clarity  we  will  describe  the  design  proce¬ 
dure  while  simultaneously  applying  it  to  one  particular  filter 
specifics  tion-namely,  the  90°  fan  filter  shown  in  Fig.  2. 

We  assume,  given  a  real  frequency  specification  hffix ,  9,)  and 
we  wish  to  design  a  recursive  filter,  to  approximate  this  specifi¬ 
cation.  We  proceed  as  follows. 

1)  For  each  value  of  9],  we  get  a  one-dimensional  frequency 
specification 

**)- 

For  the  fan  filter,  this  one-dimensional  specification  is  an 
ideal  low-pass  filter  with  a  cutoff  frequency  equal  to  |9j|. 

2)  For  each  value  of  9t ,  we  use  any  of  the  design  procedures 
available  in  one  dimension  to  design  a  one-dimensional  stable 
recursive  filter  to  approximate  )  in  the  form 

n  ak(^l)  *  +*k.l(9 1)£|  p, 

JJ,  .M0.)  +  h*.>(0,)Z:  +bk'A0y)Zl 

i.e.,  we  design  the  one-dimensional  filter  as  a  cascade  of  first- 
and  second-order  sections. 

For  simplicity,  we  will  take  the  bilinear  transformation  of  a 


design  the  compensating  filter  from  the  ideal  characteristic 
in  (3);  one  must  calculate  the  actual  response  of  the  two- 
dimensional  filter,  and  design  the  compensating  filter  di¬ 
rectly  from  this.  Since  it  is  not  our  purpose  to  go  into  one¬ 
dimensional  filter  design,  we  will  not  discuss  this  further. 

Among  the  advantages  of  the  above  procedure  is  the  fact 
that  it  can  use  classical  one-dimensional  filters,  and  so  enables 
some  of  the  intuition  for  and  experience  with  these  to  be 
carried  over  to  the  two-dimensional  case.  For  example,  the 
ripple-free  passbands  and  stopbands  of  the  filters  presented 
in  the  next  section  reflect  the  fact  that  the  design  is  based  on 
one-dimensional  Butterworth  filters. 

A  further  advantage  is  that  the  response  of  the  intermediate 
filter  (which  is  rational  in  Zj  and  transcendental  in  Z,)  is  easily 
calculated,  and  gives  an  upper  bound  on  the  performance  of 
a  filter  with  a  given  Zj-order.  If  this  bound  is  not  adequate, 
one  must  increase  the  Zj-order;  this  is  significant  in  that  in¬ 
creasing  the  Zt -order  linearly  increases  the  computation  needed 
in  implementation,  while  increasing  the  Zj-order  linearly  in¬ 
creases  both  computation  and  storage  requirements.  It  is 
therefore  normally  preferable  to  increase  the  Zx  -order  rather 
than  the  Zj-order,  if  this  is  possible.  In  the  present  procedure, 
when  one  has  decided  on  a  Zj-order,  increasing  the  Zt -order  is 
simply  a  matter  of  computing  more  Fourier  coefficients,  and 
changing  the  window  coefficients-one  does  not  have  to  redo 
the  bulk  of  the  previous  computation.  Thus  the  procedure 
lends  itself  readily  to  interactive  design. 

IV.  Examples 

The  first  example  is  the  fan  filter  designed  in  the  previous 
section.  Fig.  3  shows  the  response  of  the  filter  with  Zi-order  = 
2  and  Zt  -order  ■  <*».  Fig.  4  shows  the  response  of  the  final 
version.  As  calculated  in  the  previous  section,  the  denominator 
array  is 


while  the  numerator  array  is  simply 

000010000 
00002000  0. 

1 

The  symmetry  of  the  fan  filter  is  clearly  reflected  in  the 
symmetry  of  the  denominator  coefficients.  While  the  resulting 
filter  does  not  have  an  impressively  sharp  cutoff,  it  should  be 
remembered  that  the  two-dimensional  part  has  only  five 
distinct  coefficients  other  than  0,  1 ,  and  2,  and  was  designed 
with  a  programmable  calculator  (SR-S2).  Total  computation 
time  was  approximately  17  min  with  a  64-point  quadrature  for 
the  Fourier  coefficients.  A  higher-order  (computer-calculated) 
version  is  shown  in  Fig.  S.  The  computation  was  not  timed, 
but  may  be  judged  from  the  fact  that  the  bulk  of  the  compu¬ 
tation  consists  of  calculating  forty  one-dimensional  Fourier 
coefficients.  A  circularly  symmetric  low-pass  filter  with  cutoff 
frequency  specified  as  ir/2  is  shown  in  Fig.  6:  this  is  a  cascade 


0.0000264  0  0.05252  0  0.2477  0  0.05252  0  0.0000264 
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I 

second-order  low-pass  Butterworth  filter  in  the  present  case. 
This  gives 

_ fafll+Za)1 _ 

♦  VI  cjc  ♦  1  +  2 (cjJ  -  1  )Zj  +  (cJe  -  V2ue  +  1)2! 

!  where 

ue  *  |  tan(dt/2)j. 

3)  We  express  (2)  in  the  form 


jJi  *a(®i)  1  +^*,i(*i)Za  +^k, i(fii)Z\ 


where 

®k,/(^l)  “  flk,l(®l)/*k(®l) 

and  similarly  for  the  denominator. 

For  the  fan  filter,  we  get 

_ u4 _  1  +  22i  +  Z\ 

1  ♦  >/^we  +  wj  1+  c(®i)2j  +  d(0|)Z| 
where 


c(d,)« 


2(<4-l) 

1  ♦  >/Jwe  ♦  <•>* 


(3) 


(4) 


and 


<*•.)- 


i  -  >/Ycjc  +  uj 

i  ♦  >/?w,  ♦  «4 


At  this  stage,  the  first  factor  la  just  a  one-variable  function, 
and  can,  in  principle,  be  approximated  by  a  filter  in  Zj  by 
using  any  suitable  ooe-variable  design  algorithm.  The  second 
factor  is  a  (transcendental)  symmetric  half-plane  filter  expressed 
as  a  cascade  of  second-order  sections  inZj.  Since  each  of  the 
one-dimensional  filters  used  is  stable  by  design,  it  follows  from 
the  observation  made  in  Section  II  that  the  resulting  two- 
dimensional  filter  a  stable,  apart  from  possible  singularities 
which  may  arise  on  the  distinguished  boundary.  These  can  be 
shown  to  cause  no  stability  problems  in  practice.  (In  the 
present  example,  such  singularities  occur  at  0j  -  0,  v;  however, 
they  disappear  far  the  next  step  of  the  design  procedure.) 

4)  The  final  step  in  the  procedure  is  to  approximate  the 
functionsek,j(0|)by  trigonometric  polynomials,  and  to  approx¬ 
imate  the  functions  t)  by  trigonometric  polynomials 
fat(9x)  in  such  a  way  that  stability  is  preserved.  The  latter 
approximation  is  the  only  one  which  presents  any  difficulties. 
It  could  be  tackled  by  nonlinear  optimization  methods,  but 
we  wish  to  use  a  method  which  is  less  demanding  from  a 
computational  point  of  view. 

The  most  obvious  approach  is  to  window  the  Fourier  series 
of  the  functions  bk,/(0i).  It  is  shown  in  Appendix  I  that  the 
resulting  filter  will  be  stable  if  the  trigonometric  kernel  corre¬ 
sponding  to  the  window  used  is  positive  everywhere  and  has 
total  weight  equal  to  one.  However,  for  reasons  discussed  in 
Appendix  II,  this  gives  very  large  errors  close  to  singularities, 
and  the  following  slightly  more  complicated  version  is  used 
instead. 


Let 

+  bkA 
and 

9k  9  ^1  *fa,2  ‘  d«r,» 

and  apply  the  windowing  procedure  to  ctk  and  fa  (again  with  a 
positive  kernel  of  weight  one)  to  obtain  trigonometric  poly¬ 
nomials  ak  and  fa.  Finally,  set 

fa,i  *  i  (a*  +  9  k)  ~  I 
and 

fat  9  J  (®a  *  9k)- 

It  is  shown  in  Appendix  II  that  this  procedure  will  yield  a 
stable  filter  and  can  be  expected  to  give  much  improved  per¬ 
formance  dose  to  singularities.  It  should  be  noted,  however, 
that  this  procedure  approximately  doubles  the  number  of 
coefficients  in  the  filter. 

For  the  fan  filter,  we  have 

4  tan2  {9 1 12) 

tan*  (0t/2)  +  V?  |  tan0|/2|  +  1 
and 

_ 4 _ Y>2 

tan*(0,  12)  ♦  >/?«»  |  0i/2|  +  lJ 

Now  since  ai  and  fa  are  even,  periodic  functions  of  0i ,  they 
can  be  expanded  into  Fourier  cosine  series.  If  we  retain  only 
the  first  three  terms  of  each  series  (so  that  only  three  Fourier 
coefficients  need  actually  be  calculated  for  each),  and  apply 
a  triangular  window  (Fejer  kernel),  we  get 

a,  »  1.0587  -  (0.2518)  2  cos  0,  -  (0.00514)  2  cos  2  0, 
and 

fa  -  1.0587  ♦  (0.2518)  2  cos  0,  -  (0.00514)  2  cos  2  0, 
and,  so, 

fat  •  0.001294Z,-*  -  0.2653Zf  -  0.2653Z, 

+  0.001294Z* 
and 

b, 'j  •  0.0000264Zf*  ♦  Q.05252Z,-*  +0.2477 

+  0.05252Z?  +0.000264Z?. 

This  completes  the  design  of  the  two-dimensional  part  of  the 
filter  since  the  numerator  is  already  a  function  of  Zj  only. 

As  was  mentioned  previously,  the  one-dimensional  compen¬ 
sating  filter  in  the  examples  presented  in  the  next  section  was 
chosen  to  be  a  simple  FIR  filter,  which  was  designed  by  win¬ 
dowing.  Actually,  a  separate  compensating  filter  was  designed 
for  each  second-order  two-dimensional  section  (i.e.,  for  each 
k,  1  <  k  <  n  in  (2)] ,  the  order  of  the  compensating  filter  being 
equal  to  the  Zt -order  of  the  corresponding  section.  Unfortu¬ 
nately,  since  the  approximation  procedure  in  step  4)  exercises 
very  poor  control  over  the  passband  gain,  it  is  not  possible  to 

2A4 


If  we  then  let 


*'■>'£  I, 
r<(,,"££ 


i  -  4>)d<p 


i  "  4>)  d$ 


s  and  t  will  be  the  windowed  versions  of  s  and  t,  and  we  will 


l?(M  ”i|l  *(0W.  -4)d*\ 
<~j  |*(*)|/>(*i  -*)d* 


:±  r 

2wJ.r 

:±r 

2nJ_t 


0  -*)d* 


2P(0i 


Since  (5)  is  just  1  +  ?{8t),  we  get 

|*(0.)|<l  +  ?(«i)<2 

and  so  for  each  8X,  the  polynomial 
1  ♦  s(01)Z+/(0,)Z2 

has  no  zeros  on  the  set  {z||Z|  <  1).  It  follows  immediately 
that  the  two-variable  rational  function 


1  +  ♦  H0x)Zl 


is  stable. 


Q.E.D. 


Appendix  II 

When  the  above  windowing  procedure  was  used  directly,  the 
resulting  filters  exhibited  gross  underdamping  in  regions  where 
the  bandwidth  (in  the  02  direction)  was  close  either  to  0  or 
2m.  We  wil  now  discuss  the  reason  for  this,  and  justify  the 
procedure  given  in  step  4)  of  the  design. 

If  we  are  given  a  second  order  discrete  time  filter  with 
denominator  1  +  cZ  *  dZ *  (where  c  and  d  are  arbitrary  real 
numbers),  we  can  transform  it  into  a  continuous-time  filter 
by  using  the  bilinear  transform 


This  results  in  a  second-order  filter  whose  denominator  is 
(with  unity  leading  coefficient) 


-  „  1  - d  1 *c*d 

ft  +  2- - -r  +  - - - 

1  - c  +  d  1  -c  +  d 


In  general,  the  denominator  of  a  second-order  continuous¬ 
time  transfer  function  can  be  expressed  in  the  form 

**  ♦  2{u0  s  +  wo  (7) 

where  cjo  is  the  undamped  natural  frequency,  and  f  the  damp¬ 
ing  ratio  of  the  system  [17],  Comparison  of  (6)  and  (7)  yields, 
for  the  continuous-time  undamped  natural  frequency, 

/ 7+7+7 
— 7 

and  for  the  damping  ratio, 
r- 

V(l+c  +  d)(l-c  +  d)  ' 

Thus,  if  the  bandwidth  approaches  0  or  2ir,  we  must  have 
either  1  +  c  +  d  ■*  0 
or  1  -  c  +  </-*■  0 

and  if  the  damping  ratio  is  to  remain  finite,  we  must  have 
1  -  d  -+  0  (this  is  the  situation  in  the  fan  filter  designed  in 
Section  III). 

Now  suppose  the  errors  made  in  approximating  c  and  d  are 
of  the  same  order,  and  that  they  swamp  the  actual  values  of 
the  quantities  which  approach  0.  If  the  order  of  error  is  e,  then 

P  _£_  VL 

s/Te  2  ’ 

i.e.,  the  damping  ratio  will  be  very  small,  and  will  decrease 
with  increasingly  accurate  approximations  to  c  and  d.  Thus 
not  alone  do  we  get  poor  performance,  but  increasing  the 
order  makes  it  worse!  This  is  precisely  what  is  observed  in 
practice. 

Suppose,  however,  that  we  approximate 

a  =  y/TT7*d  and  0 *  >/l  c  +  d 

and  suppose  for  definiteness  that  1  -  c  +  </-*■  0.  Then  in  the 
above  situation  d  -*■  1  and  1  +  c  +  d  -*■  4. 

If  a  and  0  denote  the  approximations,  we  then  define 

d  =  !(a’  +£*>-  1 


*«*(«*-**) 

and  it  then  follows  that  the  damping  ratio  of  the  approximated 
filter  is 

C  ( 2-Sj)(2  +  S)*0 2 


If  we  make  the  same  assumptions  as  before,  we  get  (2  -  a )  ~  e 
and  0  ~  e  and,  so, 

?  ~  ~  ~  i 

5  2e(2  -  e) 

-a  much  more  acceptable  result. 

It  remains  only  to  show  that  this  procedure  gives  a  stable 
filter,  i.e.,  that  I  c  I  <  I  +  5  <  2  for  all  $x. 
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Fig.  6.  Circularly  lymmatrk  low -pan 


of  two  2  X  17 


Fig.  7.  Circularly  symmetric  low-para  filter: 

ractkMB. 


of  four  2  X  41 


of  two  2X17  fBteis.  Finally,  in  order  to  indicate  the  numeri¬ 
cal  stability  of  the  procedure,  the  tame  specification  was  used 
for  the  unpractically  large  filter  in  Fig.  7.  (Because  of  the  high 
Z i -order,  the  effect  of  the  one-dimensional  filter  was  inserted 
artificially  in  this  figure;  also  the  elliptical  shape  of  the  pass- 
band  is  an  artifact  of  the  plotting  system.) 

We  note  that  in  the  above  filters,  a  cascade  of  two  2  X  17 
sections,  for  example,  would  be  a  4  X  34  filter  if  it  were 
expressed  in  direct  form,  and  so  would  require  twice  as  many 
coefficients  to  be  stored  and  manipulated.  For  this  season, 
these  filters  should  be  implemented  in  cascade  form. 

V.  Conclusions 

We  have  described  an  inexpensive,  subopthnal  method  of 
designing  stable  half-plane  recursive  liters  as  a  cascade  of  fit¬ 
ters,  each  of  which  is  second-order  in  the  principal  direction 
of  recursion.  In  the  examples  presented,  analytic  expressions 
were  available  for  the  cascade  form  of  the  one-dimensional 
filters  used;  however,  even  when  this  is  not  the  case,  continua¬ 
tion  methods  [14],  [IS]  ate  ideally  suited  to  the  numerical 
calculation  of  the  appropriate  coefficients. 

The  major  open  problem  associated  with  the  method  is  un¬ 
doubtedly  the  development  of  mom  effective  apptmimatioa 
methods  for  the  denominator  coefficient  functions.  While  the 
method  used  above  gives  quite  satisfactory  results,  one  can  not 
help  feeling  that  one  should  be  able  to  do  as  erefl  with  half  the 
number  of  coefficients  (since  the  approximating  polynomial  is 
squared  after  its  order  has  .been  chosen).  Furthermore,  if  the 
approximation  procedure  controlled  the  paasband  gain  more 
tightly  than  the  present  method  does,  it  would  be  possible 
to  design  the  one-dimensional  compensating  filter  from  the 
ideal  response  rather  than  from  the  actual '  two-dimensional 
response.  Thus  an  improved  approximation  procedure  would 
yield  multiple  benefits. 

Such  approximation  procedures,  together  with  extensions  of 
the  above  design  method,  are  currently  under  investigation. 

Appendix  1 

Here,  we  prove  two  statements  made  in  the  paper.  The  first 
is  that  if  fc(0|,0i)  is  the  magnitude  response  of  any  stable 
transfer  function 


Ml  mm-M 


9i)d$ 2  -  In A. 


This  follows  from  the  fact  that,  by  the  stability  condition, 
for  each  fixed  the  one-variable  polynomial 

*A,Z) 

has  no  zeros  in  the  unit  disk  (zj|Z|  <  l).  It  therefore  has  an 
analytic  logarithm  there.  Therefore  the  function 

In  -  Reln*e*\Z) 

is  harmonic  on  the  unit  disk  and  so,  since  the  mean  value  of  a 
harmonic  function  on  a  circle  is  equal  to  its  value  at  the  center 
of  the  circle,  we  get 


£  f*  In  |*A  A|  dB  -  In  |* A  0)| 


•0.  Q-ED. 

The  second  statement  to  be  proved  is  that  windowing  by  s 
window  whose  kernel  is  positive  and  of  total  weight  one  pre¬ 
serves  stability. 

Proof:  Lai  the  kernel  be  P{8)  (an  nth  order  trigonometric 
polynomial)  and  assume  that  P(9)  >  0,  V 9,  and 


±r 
2-  1. 


P[B)  dB  -  1. 


Let  1  +s(Bx)Z  +  r(d|)Z*  be  any  second-order  polynomial  in 
Z  whose  coefficients  are  periodic  functions  of  Bx. 

Suppose  the  function  1  ♦  s(9x)Z  *  r(Bt)Z *  has  no  zeros  on 
the  set  {Z||Z|  <  1}  for  each  6- .  This  is  equivalent  to  [16] : 

|*(M  <!*'(*«) <2.  V#,. 
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This  is  dearly  equivalent  to 
|«*-0*|<S*+0*<4 

and  the  first  inequality  is  obvious  from  the  fact  that  both  a* 
and  0*  are  positive,  while  the  second  is  a  consequence  of  the 
following  argument.  We  have 


a* 


0  +  c(4>)  *  d(<t>)}  P(6  -  0)  d<t> 

A. 

with  a  similar  inequality  for  0  .  Adding  gives 
a*  +  01  <  ~~  f  2{1  +  d(<l>))P  ( 0  -4>)d4,<  4. 


Q.E.D 
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Aictrocr — An  explicit  rdatioMhip  between  the  fractional  representation 
approach  feedback  system  design  and  the  algebra-geometric  approach  to 
system  theory  is  fomudatod  and  used  to  derive  a  global  solution  to  the 
feedback  system  problem.  These  techniques  are  then  applied  to  the  stmuL 
~  taneous  stabilization  problem,  yielding  a  natural  geometric  criterion  for  a 
■  set  of  plants  to  be  simultaneously  stabilited  by  a  single  compensator. 


__  I.  Introduction 

CLASSICALLY,  in  control  theory  one  is  given  a  plant 
and  desires  to  design  a  control  system  around  this 
plant  which  meets  certain  design  specifications.  In  fact, 
however,  a  “real  world”  plant  is  never  known  exactly  and, 
as  such,  a  realistic  design  must  simultaneously  meet  specifi¬ 
cations  over  an  entire  range  of  plants  which  (hopefully) 
include  the  actual  plant  The  simplest  form  of  the  resultant 
simultaneous  design  problem  is  the  robust  design  problem 
wherein  one  desires  to  meet  the  design  specifications  in  an 
c  ball  around  a  prescribed  nominal  plant  Although  this  is 
satisfactory  for  dealing  with  modeling  errors,  it  cannot 
cope  with  plants  containing  unknown  parameters  and/or 
plants  characterized  by  multiple  modes  of  operation.  For 
instance,  the  dynamics  of  an  airplane  or  rocket  vary  widely 
with  altitude,  while  the  dynamics  of  an  electric  motor 
change  with  speed  and  load.  To  cope  with  these  problems, 
we  must  formulate  a  simultaneous  design  theory  in  which 
one  designs  a  control  system  to  simultaneously  meet  speci¬ 
fications  over  a  prescribed  set  of  plants.  Of  course,  the  set 
of  plants  may  be  taken  to  be  a  ball,  in  which  case  the 
classical  robustness  theory  is  replicated.  Alternatively,  one 
may  choose  to  work  with  a  set  of  plants  in  which  one  or 
more  parameters  vary  over  a  prescribed  range  and/or  a 
discrete  set  of  plants,  say,  the  dynamics  of  a  two-speed 
motor  in  its  high-  and  low-speed  settings. 

The  simultaneous  design  concept  is  possibly  best  il¬ 
lustrated  in  the  first-order  case  wherein  a  simple  geometric 
solution  suggests  itself.  Assume  that  our  plants  are  of  (he 
form 

'('>= 771  <"» 
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and  we  desire  to  design  a  stable  feedback  system  using  a 
proportional  compensator  with  gain  /.  This  results  in  a 
system  with  characteristic  function 

d(s)  =  s  +  (B  +  tA)  (12) 

and,  as  such,  the  feedback  system  will  be  stable  if  and  only 
if  B  +  tA  >  0.  Here,  for  a  given  compensator  t,  the  feed¬ 
back  system  will  be  stable  if  and  only  if  the  point  (A,  B ) 
lies  above  the  line  with  slope  1  /t  as  shown  in  Fig.  1(a).  As 
such,  if  we  want  to  simultaneously  stabilize  an  entire  set  of 
plants,  their  representations  on  the  A-B  plane  must  all  lie 
above  a  line  through  the  origin.  For  instance,  the  set  of 
plants  indicated  by  the  hatched  region  in  Fig.  1(b)  can  be 
simultaneously  stabilized  (by  a  compensator  with  gain 
—  £),  while  the  set  of  plants  shown  in  Fig.  1(c)  cannot  be 
simultaneously  stabilized  since  they  subtend  an  angle 
greater  than  180°  on  the  A-B  plane;  Similarly,  the  set  of 
plants  shown  in  Fig.  1(d)  cannot  be  simultaneously  stabi¬ 
lized  since  they  cross  the  negative  A  axis. 

The  above  example  suggest  two  alternative  criteria  for 
the  simultaneous  stabilization  problem.  One  may  adopt  an 
algebraic  criterion  to  the  effect  that 

B  +  tA>  0  (1.3) 

for  each  plant  in  the  prescribed  set  and  some  t.  While  such 
a  test  is  definitive,  it  is  local  in  nature,  allowing  one  to  test 
for  stabilizability  on  a  plant-by-plant  basis,  but  yielding  no 
global  criterion  with  which  to  characterize  a  set  of  plants 
which  is  simultaneously  stabilizable.  To  the  contrary,  one 
may  adopt  a  global  geometric  viewpoint  to  the  effect  that  a 
prescribed  set  of  plants  is  simultaneously  stabilizable  if  and 
only  if  it  is  contained  in  an  appropriate  half-plane.  The 
goal  of  the  present  paper  is  the  formulation  of  a  similar 
geometric  criterion  for  the  simultaneous  stabilization  prob¬ 
lem  applicable  to  general  linear  systems. 

The  starting  point  for  our  theory  is  the  ring- theoretic 
fractional  representation  theory  introduced  by  the  authors 
in  a  series  of  recent  papers  (9),  [16)  in  which  the  set  of 
compensators  for  a  given  plant  are  parameterized.  More¬ 
over,  as  a  first  cut  at  the  simultaneous  stabilization  prob¬ 
lem,  one  can  reverse  the  role  of  the  plant  and  compensator 
in  this  theory  to  parameterize  the  set  of  plants  which  are 
stabilized  by  a  given  compensator.  In  practice,  however, 
one  is  not  given  a  compensator  a  priori  and,  as  such,  we 
must  characterize  the  set  of  plants  obtained  by  the  latter 
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Fig.  I. 


parameterization  independently  of  the  choice  of  compensa¬ 
tor.  For  instance,  in  our  first-order  example,  the  set  of 
plants  in  Fig.  1(a)  are  stabilized  by  a  given  compensator 
-with  slope  1//,  while  a  given  set  of  plants  is  simultaneously 
stabilizable  if  and  only  if  it  lies  in  the  half-plane  above 
•some  line  through  the  origin.  For  the  genera]  problem,  this 
is  achieved  by  translating  the  fractional  representation 
theory  into  an  appropriate  geometric  setting  in  which  the 
“shape”  of  the  set  of  plants  obtained  from  the  latter 
parameterization  may  be  characterized.  In  turn,  the  simul¬ 
taneous  stabilization  problem  may  be  resolved  by  requiring 
that  the  given  set  of  plants  lie  in  a  region  of  the  ap¬ 
propriate  “shape.” 

Indeed,  the  appropriate  geometric  setting  proves  to  be 
just  the  Grassmannian  first  introduced  into  the  system 
theory  literature  by  Hermann  and  Martin  (11),  [12].  Unlike 
their  frequency  domain  formulation,  however,  we  obtain 
the  Grassmannian  directly  from  the  ring-theoretic  frac¬ 
tional  representation  previously,  employed  by  the  authors. 
Indeed,  the  Grassmannian  is  obtained  simply  by  factoring 
out  the  nonuniqueness  inherent  in  the  fractional  represen¬ 
tation  theory.  As  such,  in  addition  to  formulating  the 
global  theory  necessary  for  our  study  of  the  simultaneous 
stabilization  problem,  the  geometric  approach  yields  new 
insight  into  the  relationship  between  the  fractional  repre¬ 
sentation  theory  (which  we  identify  with  the  elements  of  a 
general  linear  group)  and  the  system  itself  (which  we 
identify  with  the  elements  of  a  Grassmannian). 

In  the  following  section,  the  fractional  representation 
theory  is  reviewed  and  the  required  Grassmannian  is  con¬ 
structed.  The  resultant  theory  is  then  used  to  formulate  a 
global  description  of  the  set  of  stabilizing  compensators  for 
a  given  plant  in  Section  III.  The  resultant  formulation  also 
yields  new  insight  into  the  problem  of  stabilizing  a  plant 
with  a  stable  compensator  [25]  for  which  a  necessary  and 
.sufficient  condition  is  also  derived  in  Section  III.  Finally, 
the  simultaneous  stabilization  problem  is  investigated  in 
Section  IV  wherein  both  global  geometric  and  local  alge¬ 
braic  criteria  for  the  simultaneous  stabilization  of  a  pre¬ 
scribed  family  of  plants  are  obtained. 

Although  the  present  paper  is  formulated  in  terms  of  the 
abstract  (pseudo)  coprime  fractional  representation  theory 
of  IH  (9],  [10],  [  1 6]-[  1 9],  it  should  be  pointed  out  that  this 


is  simply  one  manifestation  of  a  family  of  related  ap¬ 
proaches  to  the  contrcl  system  design  problem  developed 
during  the  past  half-dozen  years  by  Pernebo  and  Asirom 
[13],  [14];  AmsaSdis.  Pearson,  and  Cheng  [1],  [6],  [7];  Youla, 
Bongiomo,  Jabr,  and  Lu  [22]— (25);  and  Zames  [26]  among 
others  [2],  [20],  [21].  Indeed,  the  approaches  of  these  authors 
are  all  closely  related,  any  one  of  which  could  have  been 
used  as  the  basis  for  the  present  investigations.  In  particu¬ 
lar,  the  formulation  of  Zames  is  applicable  in  a  general 
ring-theoretic  setting,  and  is  essentially  equivalent  to  that 
employed  herein.  * 

II.  Fractional  Representation  and  the 
Grassmannian 

The  algebraic  fractional  representation  theory  is  set  in  a 
nest  of  rings,  groups,  and  multiplicative  structures: 

gDkDiDj. 

Here,  g  is  a  ring  with  identity  which  represents  the  general 
class  of  systems  with  which  we  wish  to  work:  rational 
matrices,  continuous  operators,  a  class  of  transcendental 
functions,  etc.;  and  A  is  a  subring  of  g  containing  the 
identity  which  models  the  systems  which  are  stable  in  some 
sense:  poles-  in  a  prescribed  region,  transcendental  func¬ 
tions  with  restricted  singularities,  causal  operators,  etc 
Finally,  f  denotes  the  multiplicative  set  composed  of  ele¬ 
ments  of  h  which  admit  an  inverse  in  g,  while/  denotes  the 
multiplicative  subgroup  of  i  made  up  of  elements  which  are 
invertible  in  h.  Detailed  examples  of  this  structure  were 
given  in  [4]  and  [8]  and  will  not  be  repeated  here 

We  say  that  a  system  s&g  has  a  right  fractional  represen¬ 
tation  in  \g,k,i,j)  if 

r  =  n„<C'  (2-1) 

where  n„Gh  and  d„Gi.  Furthermore  we  say  that  this 
representation  is  right  coprime  if  there  exist  u,,  and  o,,  in  A 
such  that 

1-  (2-2) 

This  equality  is  equivalent  to  the  classical  coprimeness 
concept  for  rational  functions  and  matrices,  while  being 
defined  in  our  general  ring-theoretic  setting.  In  particular, 
if  g  is  the  ring  of  rational  functions  and  A  is  the  ring  of 
polynomials,  (2.2)  implies  that  n„  and  dsr  have  no  common 
zeros;  and  if  g  is  the  ring  of  rational  functions  and  A  is  the 
ring  of  exponentially  stable  rational  functions,  (2.2)  implies 
that  n„  and  d„  have  no  common  right  half-plane  zeros. 

Since  g  is,  in  general,  noncommutative,  we  also  define  a 
left  fractional  representation  for  s  via  the  equality 

:  =  (2.3) 

for  nltEk  and  dltEi.  Furthermore  we  say  that  this  repre¬ 
sentation  is  left  coprime  if  there  exist  ult  and  vtl  in  A  such 
that 

(2-4) 
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Of  course,  in  the  classical  case  of  a  rational  function  or 
matrix,  these  fractional  represen  tations  are  assured  to  exist 
[10].  However,  this  is  not  the  case  in  the  general  ring-theo- 
retic  setting.  Therefore,  for  distributed,  time-varying,  and 
multidimensional  systems,  we  assume  that  our  plants  admit 
.such  a  representation  as  a  prerequisite  to  the  theory.1 
Interestingly,  however,  if  such  a  representation  exists,  we 
may,  without  loss  of  generality,  choose  utl  and  vtl  such  that 
the  equality 

=  (2-5) 

also  holds  [8].  In  this  case,  we  say  that  the  representation 
for  s  defined  by  the  eight-tuple  {*,,,  d„,  u,„v,„ntl, 
-  dlh  u,„  v,t }  is  doubly  coprime  and  we  express  the  defining 
equalities  of  (2.1)— <2-5)  via  the  matrix  equality  R~'=S, 
where 


where  o, is  an  appropriate  “partial  state”  vari¬ 
able.  As  such,  the  2  X  2  matrix  R ,  defines  a  natural  model 
for  the  given  system.  Indeed,  when  such  a  model  is  em¬ 
ployed,  one  can  drop  the  invertibility  requirement  on  d„ 
(and  dtl),  although  the  matrix  Rt  must  still  admit  an 
inverse  with  entries  in  A.1 

Since  the  2X2  matrices  R,  and  S,  have  entries  in  h  and 
admit  an  inverse  which  also  has  entries  in  A,  they  form  a 
group  which  we  denote  by  CLk( 2),  i.e.,  the  general  linear 
group  of  2X2  matrices  with  entries  in  A.  If  the  elements  of 
GLk(2)  are,  however,  to  serve  as  a  viable  system  represen¬ 
tation,  we  must  be  cognizant  of  the  fact  that  several  such 
matrices  may  represent  the  same  plant  The  appropriate 
equivalence  classes  may,  however, -be  characterized  with 
the  aid  of  the  subgroup  £  C  GL„( 2)  composed  of  the  upper 
triangular  matrices 


(2.7) 


It  is  interesting  to  compare  the  above  formulation  with 
that  proposed  by  Zames  [26].  Rather  than  working  with  an 
unstable  system,  s,  Zames  assumes  that  the  given  plant  is 
first  stabilized  via  classical  techniques,  and  then  develops 
his  design  theory  around  the  resultant  stable  plant  i.  Now, 
since  l  is  stable,  it  admits  the  trivial  right  coprime  represen¬ 
tation  i  =  J1  "  ‘‘with  the  equality 

Ms)  +  [  1  ][r]  =  u,tn„  +  ottd„  =  1  (2.8) 


implying  right  coprimeness,  while  a  similar  equality  defines 
a  left  coprime  representation  for  i.  As  such,  the  matrices 
Rf  and  Sr  take  on  a  very  special  form,  permitting  Zames  to 
implement  his  design  theory  without  explicitly  dealing  with 
u’s  and  u’s  nor  even  introducing  the  coprimeness  concept. 

The  key  to  our  geometric  formulation  of  the  fractional 
representation  theory  lies  with  the  observation  that  the 
2X2  matrices  R,  and  S,  constitute  a  natural  and  concise 
representation  for  the  given  plant  which  can  serve  in  lieu  of 
the  eight-tuple  of  rt’s,  d’s,  u’s ,  and  u's.  Indeed,  if  the 
input-output  relation  for  our  system  is  characterized  by 
the  equality 

=  =  (2.9) 

.  then  the  admissible  input-output  pairs  [9]  (<,,•’,)  for  our 
plant  are  parameterized  by  the  equality 


(2.10) 


in  GLk( 2).  Note  that  since  these  triangular  matrices  are 
assumed  to  be  in  GLk( 2),  it  follows  that  and  eu  are  in  /. 

Property  l:  Let  R,  and  R,  be  in  <?£.*( 2).  Then  Rt  and  R, 
represent  the  same  system  if  and  only  if  there  exists  £G  £ 
such  that  Rt  =  RtE. 

Proof:  If  R,  =  RtE  for  some  £€  £,  then 


where  8,  —  e^o,.  As  such,  the  set  of  input-output  pairs 
defined  by  R,  coincides  with  those  defined  by  R,  except 
for  a  change  of  parameterization. 

If  R ,  and  R,  define  the  same  set  of  input-output  pairs, 
then  for  any  such  pair  (e„  r,),  there  exist  o,  and  6,  such 
that 

l;]  =  *,o]=*.o]  (2-13) 

which,  in  turn,  implies  that 


Now,  £  =  Rj'RtE  GLk{ 2)  since  R,  and  Ri  are  in  <7£*(2), 
while  R  —  R,E  by  construction.  As  rich,  it  suffices  to  show 
that  £S  £.  This,  however,  follows  from  the  fact  that  (2.14) 
holds  for  all  o,  (and  a  corresponding  a,).  ■ 

Given  that  any  two  representations  in  GLk(2)  for  the 
same  system  differ  by  a  left  factor  in  £,  a  natural  setting 
for  our  system  theory  is  the  quotient  space  GLt(2)/E. 
Although  GLk{ 2)  is  a  group  and  £  is  a  subgroup,  £  is  not 
normal  and,  as  such,  GLh{2)/E  is  not  a  group.  For¬ 
tunately,  however,  the  resultant  coset  space  (of  equivalence 
classes)  is  a  well-known  and  much  studied  geometric  ob- 


'Compuitiionally.  ihe  evaluaiion  of  v.,  and  t,,  (or  and  l,,)  reduce! 

in  the  solution  of  a  linear  equation  in  ine  rins  h  In  particular,  if  A  is  a  3  In  the  case  where  rf* 1  does  not  exist,  the  plant  defines  a  relation  rather 
tint  of  stable  rational  functions  or  matrices  thereof,  one  can  mulupls  (2.2)  than  a  function  on  the  input-output  space.  The  resultant  relation  is. 
and  (I  4)  by  a  common  denominator,  and  thereby  reduce  their  solution  to  however,  parameterized  by  Moreoter.  since  R ,  is  invertible,  the 
that  of  a  classical  pols normal  equation.  relation  is  normal  in  the  same  sense  of  [I5|. 
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jecu  the  Grassmannian  [1]  <7,(1, 2),  which  we  will  adopt  as 
the  basic  setting  for  our  system  theory.1 

Since  £  is  not  a  normal  subgroup  of  (71,(2),  <7*(  1 . 2)  is 
not  a  group  and.  as  such,  does  not  admit  an  “internal" 
algebraic  structure.  The  group  (71,(2)  does,  however,  act  as 
a  set  of  transformations  on  <7,(  1,2).  Indeed,  if  TE  GLk( 2) 
tad  \U]E <7,(1, 2)  is  the  equivalence  class  of  C/S  (71,(2), 
we  define  ' 


showing  that  (since  E  is  invertible  in  (71,(2)). 

Conversely,  if  r,,6 /,  we  may  factor  T  via 


r(I/]*ln/]6<7*(l,2).  (2.15)  f/„  1  0  If 

Now,  if  [(/]  =  [K],  then  Property  1  implies  that  there  exists  lx«  l»J  Ml® 

£S  £  such  that  V»UE;  hence, . 


=  WE. 
(2.19) 


riK]*riw£]*[n/£]*lrt/j  (2.16) 

via  Property  1.  As  such,  the  operation  of  (71,(2)  on 
(7,11.2)  is  well  defined. 

-  As  a  prerequisite  to  the  formulation  of  our  stabilization 
-theory,  it  is  necessary  to  characterize  the  stable  systems  as 
interpreted  in  (71,(2)  and  <7,(1, 2).  Recall  that  if  s  u  stable 
(r€  A),  then  s  admits  the  doubly  coprime  fractional  repre¬ 
sentation 

*.-[]  ?)•  0”) 

Denoting  the  set  of  such  2X2  matrices  by  W  C  (71,(2),  it 
then  follows  from  Property  1  that  the  set  of  all  representa¬ 
tions  for  the  stable  systems  in  Gl,(2)  take  the  form 
S  =  WE  and,  as  such,  they  are  represented  by  (£]  »{WE\ 
=  [W]CG„(\,2).  Although  W  and  E  are  both  subgroups 
of  (71,(2),  they  are  not  commutative  and,  as  such,  5  *  WE 
is  not  a  group.  5  is,  however,  characterized  by  the  follow¬ 
ing  property. 

Property!:  Let 


be  in  <71,( 2).  Then  TES  if  and  only  if  („€/. 

Proof:  Since  £  *  WE  if  7"  6  S,  then  T  =  WE  for  some 
WE  W  and  £££;  hence, 

*tf  *  i*  Uk  field  of  scalars.  <7,(1. 2)  is  the  classical  Grassmsnmsj  (of 
lines  in  2- space),  whereas  if  *  is  taken  to  be  *  X  n  matrices.  <7,(1. 2) 
reduces  to  the  classical  Grassminruan  of  m  planes  in  2n  space  [1). 
Although  these  classical  Grassmannians  have  an  analytic  structure  (com¬ 
pact  manifold)  which  mav  not  he  shared  bv  our  abstract  Grassmannian. 
the  algebraic  properties  of  <7,(1. 2)  are  all  that  is  required  for  the  present 
theory,  and  hence  no  difficufty  is  encountered  by  working  with  elements 
taken  from  an  abstract  ring.  Indeed,  for  our  purposes,  we  oniy  use  the  fact 
that  <7,(l.2)  is  the  coset  space  <7£.»( 2)/E  and  identify  it  as  the  Grassman- 
juan  only  to  make  the  connection  with  the  classical  literature. 

It  is  interesting  to  note  that  the  Grassmannian  has  been  used  as  i 
natural  setting  (or  multivariate  systems  by  mathematical  system  theorists 
for  a  number  of  years  [2*1.  [29).  [32].  Here,  a  system  represented  by  an 
«  x  n  frequency  response  matrix  is  identified  with  a  curve  taking  values  in 
the  classical  Grassmannian  of  n  planes  in  2*  space.  As  such,  that  theory 
identifies  a  svstem  with  a  Grassmannian-valued  function,  while  our 
formulation  identifies  the  system  with  s  Grassmannian  built  from  a  ring 
of  functions  Of  course,  the  two  approaches  are  completely  equivalent  in 
the  muluvarate  case,  while  the  present  formulation  is  also  well  defined 
for  general  linear  systems  (time- varying,  distributed,  etc.). 


Since  tjtEj,  vn/j|/u'  and  ^  in  A. .Thus, 

the  unit  triangular  nature  of  W  implies  that  it  is  in  GLk(2). 
This,  in  turn,  however,  implies  that 

E  =  W-'TEGLk{2)  (2.20) 

since  (71,(2)  is  a  group.  Since  £  is  upper  triangular  and 
£  €  <71 ,( 2),  EE  E,  as  was  to  be  shown.  ■ 

Since  W  and  £  are  both  groups,  the  equality  5  =  WE 
implies  that  S~ 1  =  EW.  Now,  a  little  algebra  similar  to 
that  used  in  the  proof  of  Property  2  will  reveal  that 
TES~l  if  and  only  if  tuEj.  Indeed,  S  and  S~'  are 
precisely  the -two  classes  of  matrices  for  which  the  2X2 
matrix  inversion  formula  is  applicable  [16]. 

Before  concluding  this  section,  it  is  instructive  to  com¬ 
ment  on  the  relationship  between  (71,(2)  and  <?,(1.2).  In 
essence,  <71,( 2)  is  a  set  of  representations  for  our  systems, 
while  <7,(1, 2)  represents  the  set  of  systems.  That  is  to  say, 
(71,(2)  is  composed  of  the  computationally  tractable  ob¬ 
jects  with  which  we  actually  describe  a  system,  although 
many  such  objects  may  represent  the  same  system.  On  the 
contrary,  each  element  of  (7,(1, 2)  is  uniquely  identified 
with  a  system,  and  hence  we  may  think  of  <7,(1, 2)  as 
“being  the  set  of  systems"  (or,  at  least,  being  in  one-to-one 
correspondence  with  the  set  of  systems).  On  the  other 
hand,  the  elements  of  <7,(1, 2)  are  not  computationally 
tractable,  except  through  the  intermediary  of  (71,(2).  In 
practice,  therefore,  a  plant  is  characterized  by  one  of  its 
representations  in  (71,(2),  while  the  goal  of  the  system 
design  problem  is  to  specify  an  appropriate  compensator  in 
(7,(1, 2).  That  is,  we  are  designing  a  compensator  rather 
than  a  representation  of  a  compensator,  and  hence  even 
though  we  work  in  (71,(2)  as  a  matter  of  computational 
necessitv,  the  result  of  the  design  process  is  an  element  of 
<7,(1. 2). 

III.  Stabilization 

The  basic  feedback  system  we  consider  is  shown  in  Fig. 
2.  The  system  is  characterized  by  the  connection  equations 

=  +  (3.1) 

and 
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*f  =  Ml  —  vf%  ( 

while  the  plant  and  compensator  are  characterized  by 


:;]=*,  o'] 


r[s]  =  {[rs];se5}cc*(i.2).  (3.9) 

Finally,  let  Rf,Rp)  denote  the  set  of  stabilizing  compensa¬ 
tors  in  <j*(1,2)  for  a  given  plant  represented  by  RpE 
GLk(  2). 

Theorem :  RJiR,)  =  QR,Q'{S }. 

Proof:  To  prove  the  theorem,  we  will  show  that  the  set 
of  ail  representations  for  the  stabilizing  compensators  in 
GLk( 2)  takes  the  form  QRpQ‘S  for  some  SES,  from  which 
the  theorem  follows  upon  mapping  these  presentations  into 
corresponding  systems  which  are  identified  as  elements  of 
<7*(1,2).  If  Re  —  QRpQ'S  for  some  SES,  then 

[*,?,  +  Q'RCQP2]  =  [RpPt  +  QiQRpQ‘S)QPi 

=  Rp[Pl  +  Q'SQP1).  (3.10) 


-  respectively,  where  Rp  and  Re  are  in  GLt(2).  Letting  Q  be 
— .  the  2X2  matrix  in  GLk(2)  defined  by 


Now,  if 


>  =  f  0 

l-l 


‘21  s22 


(3.11) 


with  rn£  j,  since  SES,  a  little  algebra  will  reveal  that 


the  connection  equations  (3.1)  and  (3.2)  can  be  expressed 


'  =q  *  +  2 

"/  *v'\  Pi 


A  little  algebra  will  also  reveal  that  Q~ 1  =  Q‘  -  -  Q\  hence, 
this  connection  matrix  is  Teadily  manipulated.  Finally,  the 
substitution  of  (3.3)  and  (3.4)  into  (3.6)  yields  the  equality 


=  [*,?,  +Q’ReQP2]l']-  (3-7) 

Here  P,  =  diag(l,0]  and  P2  =  diag[0, 1). 

On  the  basis  of  the  above  formulation,  we  say  that  the 
system  is  stable  if  and  only  if 

[R'Pi  +  Q'R'QP^eGLtf).  (3.8) 

Since  [RpPx  +  Q’ReQP{\  has  entries  in  h,  (3.8)  implies  that 
its  inverse  exists  and  also  has  entries  in  A.  As  such,  a 
feedback  system  will  be  stable  if  and  only  if  the  relation¬ 
ship  between  its  input  vector  co!(ai2.Mi)  and  >ls  partial 
state  vector  col (op,oe)  is  stable.  With  the  aid  of  (3.3)  and 
(3.4),  this,  in  turn,  implies  that  the  relationship  between  the 
system  input  vector  and  all  of  its  internal  variables  is 
stable,  and  the  converse  is  also  true  [9). 

With  these  preliminaries,  we  now  proceed  with  out  first 
theorem  in  which  a  geometric  characterization  of  the  set  of 
stabilizing  compensators  for  a  given  plant  is  obtained.  To 
this  end,  recall  that  S  is  the  subset  of  <7I*(2)  correspond¬ 
ing  to  the  stable  systems  and  [S']  is  its  image  in  <7*(1,2), 
while  any  TECLk(2)  defines  a  transformation  on  (5J  C 
<?»(  1 , 2)  via 


Q‘SQ  =  [~Z  ^]€S"  <312> 

since  (Q'SQ)^  =  suE  j.  As  such, 

R,[Px  +  Q'SQP2]  =  Rp  q  €C£*(2),(3.13) 

showing  that  the  feedback  system  with  compensator 
QRpQ'S  is  stable. 

Conversely,  if  Re  is  a  stabilizing  compensator  for  the 
feedback  system, 

[*,F,  +Q'RcQP2]EGLh{2).  (3.14) 

Moreover,  since  Q  and  Rp  are  in  GLk( 2),  we  may,  without 
loss  of  generality,  assume  that  Re  is  of  the  form 


RC  =  QRPQ'X, 
in  which  case  it  suffices  to  show  that 
X  =  QR;'Q'RCES. 
To  this  end,  we  observe  that 


(3.15) 


(3.16) 


[*,/>,  +  Q'RcQP2]  =[RpP ,  +  Q'{QRpQ'X)QP:] 
=  Rp[Pj  +  Q'XQP2] 


X'XQ  =  _ 


]  ^21 


(3.17) 


(3-18) 
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Now,  since  [ RpPt  +  Q'R'QPJE GL„{ 2), 


?roo/^-  If  RpES~  '5,  there  exist  S,  and  S:  in  S  such 


1  -X-. 


Now  let 


(3-19) 


/?,  =  Sf'S,. 
RC  =  QS{'Q' 


(3-23) 

(3214) 


*  which  implies  that  x,,€  j  as  required  to  verify  that  XES. 
As  such,  an  arbitrary  stabilizing  compensator  for  our  sys¬ 
tem  is  of  the  form  Re  *  QRfQ‘X. 

The  set  of  all  representations  of  all  stabilizing  compensa¬ 
tors  in  GLk( 2)  thus  takes  the  form  QRrQ‘S  C  GL„(2). 
Now,  upon  mapping  this  set  into  the  Grassmannian,  we 
obtain 

■  [C^C,5]=C^C,l5)t  (3  JO) 

completing  the  proof.  ■ 

**  Unlike  the  previous  results  given  directly  in  terms  of  the 
fractional  representation  theory  [9],  (24]  which  arc.  local  in 
nature,  the  present  theorem  yields  a  global  description  of 
the  set  of  stabilizing  compensators  for  a  given  plant.  In¬ 
deed,  the  required  set  is  just  a  copy  of  the  stable  system  in 
the  Grassmannian  transformed  by  the  action  of  QRfQ 
It  is  interesting  to  compare  the  parameterization  of  the 
stabilizing  compensators  of  the  theorem  with  that  obtained 
directly  from  the  fractional  representation  theory.  Indeed, 
a  little  algebra  with  the  results  of  [4]  win  yield  the  equality 

Rt=QR,Q'ir;  wew  (3.21) 

for  the  family  of  stabilizing  compensators.  Recalling,  how- 
•  .ever,  that  5  =  itT,  this  parameterization  differs  from  that 
of  the  theorem  only  in  that  the  equivalence  transformation 
£  has  been  deleted.  As  such,  the  set  of  stabilizing  com¬ 
pensators  of  (3.21)  includes  exactly  one  representation  for 
each  stabilizing  compensator  rather  than  parameterizing  all 
representations  for  the  stabilizing  compensators  as  is  the 
case  with  the  present  theorem.  Of  course, 

[QR,Q‘W]=QRQ'[W]  =  QRQ'[S],  (3.22) 

showing  that  the  same  set  of  compensators  in  the  Grass¬ 
mannian  is  defined  by  the  two  theories. 

It  follows  immediately  from  the  theorem  that  a  stabiliz¬ 
ing  compensator  always  exists  given  that  the  plant  is 
modeled  by  an  RpEGL„( 2).  In  practice,  however,  one 
often  requires  that  the  compensator  be  of  a  special  form: 
stable,  memoryless,  diagonal  etc.  As  such,  if  [C]  represents 
the  desired  class  of  compensators  in  <7*(1, 2X.it  is  necessary 
_  to  design  a  compensator  in  (C]ng/^  £>'{$],  in  which  case 
it  is  not  clear  that  such  a  compensator  even  exists.  In  the 
case  where  [C]  represents  the  stable  systems,  i.e.,  we  desire 
to  find  a  stable  stabilizing  compensator  [2S],  a  simple 
criterion  for  the  existence  of  the  required  compensator  can, 
however,  be  obtained  as  follows. 

Corollary  /:  A  plant  represented  by  Rp€GLk( 2)  can  be 
stabilized  by  a  stable  compensator  if  and  only  if 


which  is  stable  since  the  congruence  transformation  de¬ 
fined  by  Q  maps  1  to  S  and  conversely.  Moreover,  since 

Q'  =  Q-\ 

*'  =  QS r  lQ'  =  esf  'Q'QSxQ'QST  XQ' 

=  (QSf  lStQ')(QSr  lQ‘)  =  QR,Q'S  (3 25) 

where  S  =  QS'  'Q'E  S.  As  such,  we  have  constructed  a 
stable'  stabilizing  compensator  for  Rp  as  required. 

Conversely,  if  RteS  is  a  stable  stabilizing  compensator 
for  Rp,  then  by  the  theorem,  Rc  =  QRpQ'S  for  some  SE  S. 
As  such, 

R,  =  Q'R{S-'Q  =  Q'RCQQ'S-'Q 

=  (fi^rC)(C,S',e)=Sf,Sl  (3.26) 

since  the  congruence  transformation  defined  by  Q  maps 

Rtes to s?'es-]  »nds~les~'  toS,es.  ■ 

Of  course,  when  the  hypotheses  of  the  corollary  hold,  the 
set  of  stabU  stabilizing  compensators  for  Rp  =  Sf ‘S,  is 
nonvoid  and  given  by  No  explicit  para¬ 

meterization  for  this  set  is,  however,  known  nor  is  it 
obvious  that  one  even  exists.  This  intuition  will  be  explored 
further  in  the  following  section  on  the  simultaneous  stabili¬ 
zation  problem. 

IV.  Simultaneous  Stabilization 

The  key  to  our  solution  of  the  simultaneous  stabilization 
problem  lies  with  a  reversal  of  the  analysis  used  in  the 
derivation  of  the  stabilization  theorem.  That  is,  one  as¬ 
sumes  that  Re e  GL„( 2)  is  given  and  parameterizes  the  set 
of  plants  which  are  stabilized  by  Re.  Denoting  that  set  of 
plants  by  Rp(Rt),  we  obtain  the  following  theorem.  Since 
the  proof  for  the  theorem  is  virtually  identical  to  that 
already  given  for  the  stabilization  theorem  of  the  previous 
section,  it  will  not  be  repeated  here. 

Theorem:  R,(Re)  =  QRcQ‘[S). 

Since  every  TEGLk(2)  is  of  the  form  T  —  QReQ'  (with 
Rt~Q'TQ),  the  lemma  implies  that  a  set  of  plants  PC 
(7*(1,2)  can  be  simultaneously  stabilized  if  and  only  if  they 
lie  in  a  copy  of  the  stable  systems  [5]  transformed  by  the 
action  of  some  TE  GLh(2),  Indeed,  this  is  the  desired 
generalization  of  the  first-order  example  given  in  the  intro¬ 
duction  to  the  case  of  a  general  linear  system.  In  the 
general  case,  the  Grassmannian  plays  the  role  of  the  A-B 
plane,  the  stables,  (5],  play  the  role  of  the  half-plane,  and 
the  general  linear  group,  C7L*(2),  serves  as  the  group  of 
"rotations.”  These  observations  are  summarized  in  the 
following  corollary. 

Corollary  2:  A  set  of  plants  P  C  G*(l,2)  can  be.  simulta¬ 


neously  stabilized  if  and  onlv  if  they  lie  in  r[Sj  for  some 

R,es~'s.  reGL,(\.2). 
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Although  the  corollary  represents  a  complete  geometric 
solution  to  the  simultaneous  stabilization  problem,  it  is  not 
amenable  to  convenient  implementation.  We  therefore  give 
an  alternative  algebraic  solution  to  the  problem  in  <7Z.*(2). 

Corollary  3:  Let  RpE  JLk(2),  pEP  be  representations 
for  a  set  of  plants  P  CGk(l,2).  Then  P  can  be  simulta¬ 
neously  stabilized  if  and  only  if  there  exists  a  family  of 
matrices  Sp  E  S,  p  E  P  such  that 


As  such,  Rc  stabilizes  the  zero  plant  if  and  only  if  dcrE  j 
or,  equivalently,  R(ES.  The  problem  of  stabilizing  p  with 
a  stable  compensator  is  thus  equivalent  to  simultaneously 
stabilizing  p  and  the  zero  plant.  Since  the  zero  plant  is 
represented  by  le  GLh(2 ),  it  follows  from  (4.6)  that  p  can 
be  stabilized  by  a  stable  compensator  if  and  only  if 

Rp  =  {))~'  Rp  =  R~[RpES~'S  (4.9) 


s-'s, p^gp- 

Proof:  If  Re  simultaneously  stabilizes  Rp,  pEP,  then 
it  follows  from  Corollary  2  that  there  exist  SpES,  pEP 
such  that 

Rp  =  QR'Q'Sp;  pEP.  (4.1) 

-  Hence, 

/?;'y?f=(^fC'5,)"(c/?fe'sJ  =  S7ls;.  (4.2) 

■  Conversely  if  there  exists  SpES,  pE  P  such  that  R~xRq  = 
S~'Sq\  p,qE  P,  then  we  let  p'  be  an  arbitrary  plant  in  P 
and  define  Rc  via 

Re  =  Q'RpS;.'Q.  (4.3) 


Re  is  independent  of  the  choice  of  p’.  Indeed,  if  p"  is  an 
‘alternative  choice,  then  since  R~' R p~  ~  Sp  'Sp~, 

Rrs;.'  =  Rf.(R-'Rf..SpZ')  =  RP„SPV.  (4.4) 

.  Moreover,  for  any  pE  P,  RJ-'Rp  —  Sff'Sp:  hence, 

Rp  =  Rp  S~.  'Sp  =  QQ'Rp  Sp~  lQQ'Sp 

=  q{Q'Rps;'q.)q'sp=qrcq'sp  (4'.5) 

from  which  it  follows,  via  the  corollary,  that  the  set  of 
plants  P  C  Gk(  1 , 2)  is  simultaneously  stabilized  by  R  c.  ■ 
It  is  interesting  to.  note  that  in  the  special  case  in  which 
P  contains  exactly  two  plants,  say  p  and  q,  then  they  are 
simultaneously  stabilizable  if  and  only  if 

R~p'RqES~'S  (4.6) 


which  is  identical  to  the  criterion  of  Corollary  1  to  stabilize 
a  plant  with  a  stable  compensator.  Indeed,  this  should  not 
be  surprising  since  the  problem  of  stabilizing  a  plant  with  a 
stable  compensator  is  completely  equivalent  to  the  problem 
of  simultaneously  stabilizing  the  given  plant  and  the  zero 
plant  (represented  by  R,  —  1 E  <7L*(2)].  This  follows  im¬ 
mediately  from  the  definition  of  stability  used  in  (4)  or, 
alternatively,  one  may  let  Rt  =  1  and 


in  which  case 


[R,P,  +  Q'R{QP1]  = 


-  n. 


(4.8) 


which  replicates  the  result  of  Corollary  3. 

Of  course,  a  similar  argument  can  be  used  to  obtain  an 
algebraic  criterion  for  the  simultaneous  stabilization  of  a 
set  of  plants  P  by  a  stable  compensator.  To  this  end,  we 
simply  augment  the  set  of  plants  P  by  the  zero  plant  and 
then ‘apply  the  theorem  to  P'  —  (0)  U/*.  Alternatively,  a 
geometric  criterion  for  the  simultaneous  stabilization  of  P 
may  be  obtained  by  requiring  that  PCTIS]  for  some 
TES~X.  Since  the  corresponding  Re  is  given  by 

KC  =  Q'TQ,  (4.10) 

the  resultant  compensator  will  be  stable. 

Finally,  the  possibility  of  parameterizing  the  set  of  com¬ 
pensators  (or  stable  compensators)  which  simultaneously 
stabilize  P  should  be  considered.  In  effect,  this  amounts  to 
parameterizing  the  set  of  2X2  matrices  TEGLk(2)  or 
TES~]  such  that  P  C  T[5]  which,  in  turn,  requires  some 
kind  of  parameterization  for  P.  In  particular,  if  />  =  T[5), 
then  Rt  =  Q'TQ  is  the  unique  stabilizing  compensator, 
while  the  stabilizing  compensators  for  a  single  plant  p  may 
be  parameterized  by  [S]  as  per  the  stabilization  theorem  of 
Section  III. 


V.  Examples 

Since  the  action  of  GLh{ 2)  on  (7*(1,2)  is  a  geometric 
invariant,  one  can,  at  least  intuitively,  say  that  the  "shape” 
of  T[5]  is  identical  to  that  of  (5],  and  every  set  whose 
“shape"  is  the  same  as  (S)  may  be  obtained  from  (5]  by 
such  a  transformation.  As  such.  Corollary  4  implies  that  a 
prescribed  set  of  plants  P  C  <7*(1,2)  admits  a  simultaneous 
stabilization  if  and  only  if  P  is  contained  in  a  subset  of  the 
Grassmannian  whose  “shape”  is  the  same  as  [S],  For 
instance,  in  the  example  of  Fig.  1,  P  must  be  contained  in 
an  appropriate  half-plane. 

Although  well-defined  coordinate  systems  do  exist  in  the 
Grassmannian,  at  the  time  of  this  writing  we  have  yet  to 
formulate  a  computational  algorithm  for  implementing  the 
above-described  simultaneous  stabilization  problem  in  any 
degree  of  generality.  Interestingly,  however,  in  the  case 
where  P  is  composed  of  exactly  two  plants  (a  two-speed 
motor  or  the  dynamics  of  a  swing-wing  aircraft),  a  simple 
frequency  domain  criterion  for  the  simultaneous  stabili¬ 
zation  problem  was  given  by  Chua  [8],  (16)  and  the  authors 
in  the  single-variate  case,  and  has  since  been  extended  to 
the  multivariate  case  by  Vidyasagar  and  Viswanadham 
(19). 

To  illustrate  the  above-described  general  geometric  crite¬ 
rion  in  a  computationally  trackable  setting,  consider  the 


case  of  a  set  of  third-order  single-variate  plants 


f  +  PzS  ^  P\S+Po 


(5-1) 


.  which  are  to  be  stabilized  by  a  proportional  compensator 
c(s)  =  r.  To  represent  the  class  of  plants  geometrically,  we 
identify  the  six-dimensional  space  of  such  plants  with  a 
family  of  Jnree-dimensional  Euclidian  spaces  (with  coordi¬ 
nates  pQ,  P).  P2)  parameterized  by  the  numerator  coeffi¬ 
cients  q0,  qx,  and  q2.  As  such,  we  have  a  three-dimensional 
family  of  three-dimensional  spaces  £(„.,„«,)•  Of  course,  if 
there  is  no  feedback  (r  =  0),  such  a  system  will  be  stable  if 
j,  and  only  if 

Po.Pi>0  (52) 

-  and 

P\>Po/Pi  -  (5-3) 

.independently  of  the  numerator  parameters.  The  resultant 
stability  region  for  the  /  =  0  case  is  thus  as  illustrated  in 
Fig.  3(a). 

In  the  case  of  a  nonzero  compensator,  a  similar  argu¬ 
ment  with  the  Hurwitz  criterion  will  yield  the  set  of 
inequalities 

Fo +  '*<>>  0  (5-4) 

p,  +  rq,>0  (5.5) 

and 

P\>(Po  +  '<la)/(Pi  +  t<h)-  (5.6) 

As  such,  for  a  fixed  value  of  the  numerator  coefficients  q0, 

q ,,  and  q2,  the  new  stability  region  is  identical  in  ‘‘shape” 
to  the  t  *  0  case  except  for  a  shift  of  the  origin  to  the  point 
(rq0,rq„rq:)  in  as  illustrated  in  Fig.  3(b).  Of 

course,  as  r  changes,  the  origin  of  the  stability  region  moves 
along  the  line  determined  by  the  point  (q<>’?i>?j).  Unlike 
the  /  =  0  case,  however,  where  the  stability  region  is  inde¬ 
pendent  of  the  numerator  parameters,  the  line  along  which 
the  origin  of  the  stability  region  moves  is  determined  by 
the  numerators  parameters.  Thus,  in  this  example,  our 
three-dimensional  family  of  three-dimensional  Euclidian 
spaces  plays  the  role  of  the  Grassmannian,  with 

the  region  defined  by  (5.2)  and  (5.3)  in  each  such  space 
characterizing  the  stable  systems.  Furthermore,  the  real 
group  (corresponding  to  the  proportional  compensators) 
acts  on  this  space  by  translating  the  stability  region  in  the 

-  space  along  a  line  determined  by  the  point 

■  Although  the  above  example  was  derived  from  basic 
principles,  we  believe  that  it  illustrates  the  essential  geo¬ 
metric  nature  of  the  simultaneous  stabilization  problem  as 
formulated  in  our  abstract  theory.  Indeed,  a  prescribed  set 
of  plants  can  be  simultaneously  stabilized  if  and  only  if 
they  are  contained  in  a  translate  of  the  stables. 


(*) 


Rg  3. 


VI.  Conclusions 

Our  purpose  in  the  preceding  has  been  threefold.  First, 
we  have  attempted  to  exhibit  the  essential  relationship 
between  the  fractional  representation  theory  and  the  alge- 
bro-geometric  approach  to  system  theory.  Second,  we  have 
presented  a  global  solution  to  the  feedback  system  sta¬ 
bilization  problem.  Third,  a  solution  to  the  simultaneous 
stabilization  problem  has  been  presented.  It  should,  how¬ 
ever,  be  pointed  out  that  the  solution  presented  for  the 
simultaneous  stabilization  problem  is  mathematical  in  na¬ 
ture  and  not  intended  for  computational  implementation. 
At  the  present  time,  no  computationally  feasible  solution 
to  the  simultaneous  stabilization  problem  is  known,  except 
in  the  case  where  P  contains  exactly  two  plants  wherein  a 
simple  frequency  domain  test  is  possible  [8],  [16]  and  in  the 
simple  low  dimensional  illustrated  herein. 
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A8STRACT 

The  problem  of  designing  a  feedback  controller 
which  stabilizes  a  number  of  plants  simultaneously 
is  discussed  from  the  fractional  representation 
point  of  view.  An  abstract  solution  of  this  gen¬ 
eral  simultaneous  stabilization  problem  is  present¬ 
ed,  and  an  elementary,  explicit  criterion  is  given 
for  the  simultaneous  stabilizability  of  two  systeas. 
finally,  some  examples  and  counter  examples  are 
presented,  and  some  open  problems  aje  discussed. 


unstable)  stabilizing  compensator  for  a  given  plant 
is  reviewed. 

2.  SIMULTANEOUS  STABILIZATION 
AND  STABLE  STABILIZATION  * 

We  consider  the  feedback  system  shown  in  Fig. 
1;  this  is  characterized  by  the  connection  equa¬ 
tions 


1.  INTRODUCTION 


Classically,  in  control  theory  one  is  given  a 
plant  and  desires  to  design  a  control  system  a- 
round  this  plant  which  meets  certain  design  spec¬ 
ifications.  In  fact,  however,  a  "real  world* 
plant  is  never  known  exactly  and,  as  such,  a  rea¬ 
listic  design  must  simultaneously  meet  specifica¬ 
tions  over  an  entire  range  of  plants  which  (hope¬ 
fully)  include  the  actual  plant.  The  simplest 
form  of  the  resultant  aumilXantoua  dtkian  p/iobio* 
is  the  aobuac  design  problem  wherein  one  desires 
to  meet  the  design  specifications  in  an  c-ball 
around  a  prescribed  nominal  plant.  Although  this 
is  satisfactory  for  dealing  with  modeling  errors  It 
cannot  cope  with  plants  containing  unknown  para¬ 
meters  tnC/or  plants  characterized  by  multiple 
modes  of  operation.  For  Instance,  the  dynamics  of 
an  airplane  or  rocket  very  widely  with  altitude 
while  the  dynamics  of  an  electric  motor  change  with 
speed  and  load.  To  cope  with  these  problems  we 
must  formulate  a  ^uniLzcjitoiu  design  Zhtotuj  in 
which  one  designs  a  control  system  to  simultaneous¬ 
ly  meet  specifications  over  a  prescribed  set  of 
plants.  Of  course,  the  set  of  plants  may  be  taken 
to  be  a  ball  in  which  case  the  classical  robustness 
theory  is  replicated.  Alternatively,  one  may 
choose  to  work  with  a  set  of  plants  in  which  one  or 
more  parameters  vary  over  a  prescribed  range  and/ 
or  a  discrete  set  of  plants;  say,  the  dynamics  of 
a  two  speed  motor  in  its  high  and  low  speed 
settings. 


where  r0  1  - 

Q*  ,  (2.1) 

-1  0  . 

—  — 


To  describe  the  dynamics  of  the  plant  and 
controller,  we  use  the  abstrect  fractional  repre¬ 
sentation  theory  of  [ID  ,  [2],  [3J.  This  essumes 
four  sets 

9  3  h  s  i  z»  j 

where  g  is  a  ring  with  identity  which  represents 
the  general  class  of  systems  with  which  we  wish  to 
work,  h  is  a  subring  of  g  corresponding  to  the 
stable  systems  in  g,  i  is  a  multiplicative  set 
consisting  of  the  elements  In  h  which  have  inverses 
in  g,  and  j  is  the  subgroup  of  h  which  consists  of 
the  elements  of  h  which  have  inverses  in  h. 


The  purpose  of  this  paper  is  to  review  the  We  assume  that  the  plant  P  has  a  right-coprime 

state  cf  research  on  the  simultaneous  design  prob-  fractional  representation  • 

lem  Including  the  derivation  ef  an  explicit  cri-  . 

terion  of  the  simultaneous  stabilization  of  two  P  ■  H  D* 

distinct  plants  and  an  algebro-geometric  solution  r  r  ■ 

of  the  general  simultaneous  stabilization  problem.  where  the  coprimeness  is  exhibited  by 
In  addition,  the  fundamental  relationship  between 

the  simultaneous  stabilization  problem  and  the  U  N  *  V  O  *  1, 

problem  of  designing  a  stable  (or  minimally 
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*rai  *lse  a  left-capriae  representation 


?  -  0 


k  u  *  o  V  •  1 

It  It  • 

It  has  been  shown  r[23  that  one  can  then  find  U, 

and  V  so  that,  in  addition, 

>  •  •  . 

v*||  •  UV 
rut  Pt  • 

Thus  the  plant  P  can  be  described  by  the  matrix 


Theorem  1:  [<3 

The  set  of  plants  stabil liable  by  a  compen¬ 
sator  C  is  precisely  the  set  of  stable  plants 
transformed  by  left  multiplication  by  Qftc9* 

This  can  also  be  restated  as  follows:  A  set 
P  of  plants  is  simultaneously  stabilittble  iff 
P  lies  in  an  image  of  the  stables  under  left  mul¬ 
tiplication  by  some  element  of  GLh(2). 

These  criteria,  while  geometrically  appealing, 
can  not  be  handled  analytically;  the  following 
equivalent  criterion  is  therefor*  useful. 


Theorem  2: 


“  RP  * 


or  fts  Inverse 


.  he  admissible  input-output  pairs(c  ,  v  )  for  the 
.plant  are  then  described  by  p  p 


-/here  ep  is  a  kind  of  •partial  state*  for  the 
lant  P. 

The  corresponding  matrices  for  the  controller 
11  be  denoted  by  Re  end  $c,  etc.. 

It  can  then  be  shown  [33.(4]  that  a  given  con- 
roller  C  will  stabilize  a  plant  P  if  and  only  if 

a  •  QR  0^1  (2.2) 

P  c 

->ert  0  is  tht  connection  matrix  (2.1)*  and  W  is  of 


®]  pll  e12 

l  _°  e22 


•re  each  matrix  is  in  Glj(h)  -  i.e.,  is  a  2x? 

.rim  of  elements  cf  h  which  has  an  inverse  whose 
aments  ere  in  h.  It  is  easy  to  see  that  being  of 
form  (2.2)  is  ecuivalent  to  being  an  element 
Glj(h')  whose  (l,l)-element  is  in  j,  and  that 
«  in  turn  is  ecuivalent  to  being  the  S-metrix 
a  stable  system.  Thus  In  terms  of  the  R-matrix 
resentatlon,  we  can  restate  (2.2)  as 


Let  P'  'be  a  set  of  plants  represented  by 
ftp.  p  c  P.  The  the  set  P  is  simultaneously  rtab- 

11 liable  iff  there  exists  a  family  of  matrices 
Wp,  p  e  P,  of  the  form  (2.3)  such  that 

•  ft"1*,  ,  Y  P.  Q  c  P. 

Proof: 

The  necessity  of  this  condition  is  obvious 
from  (2.2).  The  sufficiency  can  be  checked  by  de¬ 
fining 

«;  •  ofy;1  o 

for  any  p  t  P,  and  using  the  condition  to  check 
that  Rc  is  well  defined.  Then  (2.2)  follows. 

One  can  Insert  the  requirement  that  the  com¬ 
pensator  be  stable  merely  by  adjoining  the  zero 
plant  to  the  given  set  of  plants  -  a  compensator 
is  stable  If  and  only  if  it  stabilizes  the  zero 
plant.  Thus  the  stable  stabilization  of  n  plants 
simultaneously  can  be  treated  as  a  problem  of  sim¬ 
ultaneously  stabilizing  n*l  plants.  The  converse, 
although  less  obvious,  is  also  true  -  see  [33. 

3.  THE  TWO-PLANT  CASE 

The  only  case  of  which  we  know. in  which  even 
the  analytic  criterion  in  section  2  can  be  imple¬ 
mented  is  the  case  of  two  plants.  In  this  case, 
the  criterion  is  as  follows:  two  plants,  with  rep¬ 
resenting  matrices  and  R^,  can  be  simultaneously 

stabilized  iff  there  exist  matrices  end  of 

the  form  (2.3)  such  that 

■  *\\  (3.1) 

We  note  that,  since  the  identity  K-metrix 
corresponds  to  tne  zero  plant,  this  is  equivalent 

to  the  condition  that  the  "plant"  have  a 

stable  stabilizing  compensator.  (This  is  an  ex¬ 
ample  of  the  converse  mentioned  at  the  end  of 
section  2.)  For  the  linear  time-invariant  case, 
the  problem  has  been  solved  by  Youla  [53.  It  is 
of  interest,  however,  to  relate  Ycule’s  solution 


to  our  approach.  To  this  end,  we  denote  R^R2 


end  restrict  ourselves  to  the  sceler-input  scalar- 
output  case.  If  we  write  (3.1)  in  the  form 


EXAMPLES:  f<] 

Suppose  given  a  family  of  plants  of  the  f0f% 

p(s)  *  ik 

we  would  like  to  know  when  it  is  possible  to  stab¬ 
ilize  these  simultaneously  by  use  of  a  proportion*! 
compensator  with  gain  t.  In  this  case  the  denomi¬ 
nator  of  the  closed-loop  transfer  function  is 


and  use  the  fact  that  the  (l,l)-entry  in  each. of 
the  h'-matrices  must  be  In  j,  we  get  the  condition: 

0  ♦  TH  c  i  for  some  t  t  h. 

In  the  present  case,  this  is  clearly  equivalent  to 
requiring  the  existence  of  a  stable,  minimum-phase 
transfer  function  which,  at  each  closed  right  half¬ 
plane  zero  of  N,  interpolates  0  to  an  order  equal 
at  least  to  the  order  of  the  2ero  of  N.  Continuity 
and  realness  of  the  transfer  functions  show  that  a 
necessary  condition  for  this  is  that  D  have  -the 
same  sign  at  all  closed  positive  real-axis  zeros  of 
K.  You la  showed  that  this  was  also  sufficient. 

Thus  to  solve  the  problem  of  simultaneously 
stabilizing  two  plants,  it  remains  only  to  express 

K  and  D  in  terms  of  the  original  plants.  An  easy 
calculation  shows  that  we  can  take 


d(s)  ■  .$  *  (BnA) 

and  so  the  feedback  system  will  t  stable  if f 
8+Ta  >  0.  Thus  a  set  of  plants  is  simultaneously  • 
stabili zable  iff  for  some  t,  B+tA  »  o  for  each  r 
plant  in  the  set.  Now  each  plant  can  be  represent¬ 
ed  as- a  point  in  the  (A.B)-plane;  in  this  represen¬ 
tation,  a  set  of  plants  is  simultaneously  stabi- 
lizable  iff  the  set  lies  entirely  above  some 
straight  line  through  the  origin.  The  slope  of 
such  a  line  is  -t,  where  t  is  the  gain  of  a  stab¬ 
ilizing  compensator.  For  example,  the  set  in 
Figure  2.b  is  stabilizable,  while  the  sets  in 
Figures  2.c  and  2.d  are  not.  Since  the  set  of 
stables  in  this  case  is  the  upper  half-plane,  this 
gives  a  very  vivid  (although  very  special)  fllus-  • 
‘ration  of  theorem  1:  a  set  of  plants  is  siwjltjtn. 
eously  stabilizable  iff  it  is  contained  in  a  ro¬ 
tated  (or  more  accurately,  sheared)  version  of  the 
set  of  stables. 


N  »  N201  - 

and  D  •  V.^  ♦  . 

where 


Thus  in  order  for  the  plants  to  be  simultan¬ 
eously  stabilizable,  it  is  necessary  and  suffic¬ 
ient  that  V-jCj  «■  U-jNj  h*we  lhe  same  *t 

closed  positive  reel-axis  2eres  of  ^0^  -  N ^ 0^ - 

Some  calculations  involving  the  coprimeness  con¬ 
ditions  show  that  this  is  equivalent  to  the  con¬ 
dition  that  0-/0,  have  the  same  sign  at  all 
dosed  positive  real-axis  zeros  of 

This  gives  a  criterion  in  terms  of  the  transfer- 
functions  tnenselves  PHO. 

4.  EXAMPLES  AND  PROBLEMS 


Our  second  example  is  a  counterexample  to 
the  effect  that  even  if  every  pair  of  plants  in  a  . 
set  is  simultaneously  stabilizable,  the  entire  set 
may  not  be.  To  this  end,  we  take  a  set  consisting 
of  three  plants  ipQ.p^ ,p2);  here  pQ  is  the  zero 

plant,  and 

P1  ■  nl/d1  ,  p2  •  nZ/d2. 

where  n^,  d^,  n2d^  and  d2  are  graphed  in  Figure  3. 

It  is  easy  to  see,  by  using  Youla's  criterion,  that 
end  p2  each  have  stable  stabilizing  compensators, 

and  by  using  the  criterion  in  section  3,  that  p^ 

end  p2  have  a  simultaneous  stabilizing  compensator. 

However,  there  is  no  stable  compensator  which  si»-. 
ulteneously  stabilizes  p,  end  n-.  If  there  were, 
then  by  the  criterion  of 'Theorem  2  there  would  be 
a  stable  transfer  function  f  such  that  both  d^  ♦ 

fn^  *  and  ■*  fn^  *  were  stable  and  minimum- 

phase.  A:  the  2ero$  of  n^,  d^  >0  and  so  r^  must 

be  positive  on  the  positive  real  axis.  Similarly 
r2  must  be  positive  on  the  positive  real  axis. 

However,  if  we  eliminate  f  we  see  that 
n2dl  ’  nld2  *  n2rl  '  nlr2 


The  only  case  in  which  the  geometric  results  8nl*  s0*  81  the  2eros  of  *  "t'V  wf  must  h*ve 
can  be  illustrated  on  paper  is  the  case  in  wr.ich  P  -  ,  n,r,.  Eut  at  these  points,  n,  >  0  and 
there  are  only  two  parameters.  For  this  reason  our  c  i  i  c  .  1 

first  example  will  deal  with  this  situation.  .  n2  *  D»  8n^  so  we  get  a  contradiction.  Thus  Pq.P^ 


Figure  1.  Feedback  Syste® 
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5.  CONCLUSIONS 

We  have  discussed  the  problem  of  finding  a 
compensator  which  stabilizes  every  plant  in  a  given 
set  of  plants.  In  the  abstract,  both  geometric 
and  analytic  criteria  have  been  given  for  the  ex¬ 
istence  of  such  a  compensator.  However,  only  in 
very  special  cases,  such  as  the  case  of  two'plants, 
can  these  criteria  be  checked.  Another  special 
case,  in  which  the  geometric  criterion  becomes 
particularly  clear,  is  the  case  of  first-order 
plants  with  proportional  controllers.  We  have  also 
given  an  example  where  pairwise  simultaneous  stab- 
llisability  of  a  set  of  plants  does  not  imply  over¬ 
all  simultaneous  stabili lability.  Finally,  we  have 
indicated  some  directions  for  further  research. 
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Abstract 

It  is  well  known  that  the  noise  processes  corrupting  an  image  are  in  general  signal- 
dependent.  An  interesting  aspect  of  signal-dependent  noise  is  that  there  is  a  certain 
amount  of  signal-information  embedded  in  the  noise.  Most  of  the  image  restoration  tech¬ 
niques,  however,  attempt  to  suppress  the  noise  terms  to  obtain  an  estimate  of  the  image 
and  do  not  exploit  the  additional  signal  information  contained  in  the  noise.  A  simple 
technique  designed  to  demonstrate  the  potential  for  signal  extraction  from  signal-dependent 
noise  is  presented  in  this  paper. 

Introduction 

An  interesting  aspect  of  signal-dependent  noise  sources  is  that  a  certain  amount  of  sig¬ 
nal  information  is  embedded  in  the  noise.1  Although  the  signal  dependence  of  the  noise 
renders  the  image  restoration  problem  more  complicated,  better  estimates  should  be  possible 
if  the  additional  information  contained  in  the  signal-dependent  noise  is  extracted  in  the 
estimation  process.  In  fact,  in  situations  where  the  noise  term  dominates  the  signal  term, 
the  signal  information  present  in  the  noise  term  may  be  larger  than  the  signal  term  itself. 
A  simple  technique  designed  to  recover  the  signal  from  signal-dependent  noise  is  presented 
in  this  paper. 

Signal  recovery  from  Gaussian  signal-dependent  noise 

Consider  the  image  formation  model1  given  by  the  following  equation: 

R  »  cS  +  kffSjNj  +  N2  .  (1) 

Here  R  is  the  noisy  measurement,  S  is  the  signal  to  be  estimated,  f (S)  is  a  function  of  the 
Signal,  Nj  and  N2  are  signal-independent,  statistically  independent  zero  mean  Gaussian  ran- 

2  2 

dom  processes  with  variances  and  c2  respectively,  and  c  and  k  are  scalar  constants.  In 
order  to  simulate  the  "worst  case"  situation,  we  let  c«0  and  k*l  in  Eq.  (1)  to  obtain 

R  =  f(S)Nx  +  N2  .  (2) 

j-  this  model,  all  the  signal  information  is  contained  in  the  signal-dependent  noise.  In 
this  situation,  estimation  of  the  signal  is  possible  only  by  exploiting  the  signal-depen¬ 
dence  cf  the  noise.  The  mean,  uR,  and  the  variance,  c2  ,  of  the  observation  R  are  given  by 


uR  »  EffCS)^  +  N2]  *  0  , 


c2  -  E[(R-ur>2] 

*  E[ (f (S))2]c2  +  0 2  .  (4> 

At  each  observation  point  we  may  treat  the  signal  S  as  an  unknown  constant.  Then  the  vari¬ 
ance  cf  ?.  conditioned  on  S  *  s  is 

o2  »  <f  (s) )  2o?  +  .  (5) 


Then  we  have 


.  <->  I  .E 


(6) 


Thus  when  the  function  describing  the  signal  dependence  has  an  inverse,  an  estimate  of  the 


signal,  s,  nay  be  obtained  by  estimating  the  variance  of  the  noisy  observation,  5 
and  a  knowledge  of  the  parameters  of  the  noise  processes.  In  particular  when 


R|S«s  ' 


where  p  is  a  known  non-zero  constant,  we  obtain 

.  _  I  ffR!s»s  "  a 


An  estimate  of  the  variance  of  the  noisy  observation  R  may  be  obtained  by  assuming  the  ran¬ 
dom  variable  to  be  ergodic  on  a  local  basis  and  using  a  spatial  window  of  size  M  x  M  cen¬ 
tered  around  the  pixel  being  estimated;  i.e.. 


where 


r  (i+k) , ( j+1) 


.  I  ttr 

m2  k  1  r (i+Jc) ,  (j+1) 


Here  the  subscripts  i,j  represent  the  coordinates  of  the  pixel  being  estimated,  r^  is 

the  noisy  observation  at  i,j  and  the  summation  is  carried  out  over  all  the  pixels  in  the 
estimation  window.  In  order  to  reduce  the  effect  of  spatial  smoothing  it  is  desirable  to 
keep  the  size  of  the  estimation  window  small.  A  noisy  image  obtained  when  the  observation 
model  of  Eq.  (2)  is  applied  to  the  original  image  of  Fig.  1,  with  f (s)  ■  is  shown  in 
Fig.  2.  A  constant  background  has  been  added  to  this  image  so  that  the  resulting  gray 
levels  are  all  positive.  (This  constant  background,  however,  will  not  affect  the  estimate 
obtained  as  the  estimate  is  based  on  the  variance.)  The  image  extracted  from  this  noisy 
observation  using  the  estimation  technique  described  in  this  section  is  shown  in  Fig.  3. 
This  restored  image  indicates  that  the  estimator  is  successful  in  recovering  the  gross 
structure  of  the  original  image. 

The  estimation  technique  presented  above  is  not  limited  to  the  "worst  case"  imaging 
situation  discussed  here,  and  has  been  extended  to  situations  when  the  signal  embedded  in 
the  noise  is  only  a  part  of  the  total  signal  information.1 

Signal  recovery  from  Poisson  noise 

Images  formed  at  low  light  levels  are  corrupted  by  noise  associated  with  the  discrete 
nature  of  the  photon  counting  process.  Such  images  are  well  modeled*  by  the  equation, 

R  -  \  P[XS]  ,  (11, 

where  P[XS]  represents  a  Poisson  process  with  >.S  as  the  parameter,  and  R  and  S  are  the 
r.oisy  observation  and  the  original  signal  respectively.  The  scaling  factor  i  is  used 

to  make  the  mean  of  the  measurement  R  and  the  mean  of  the  signal  S  equal  to  each  other. 

:.*ote  that  in  this  model  the  noise  is  inherently  signal-dependent.  A  noisy  image  obtained 
from  the  original  image  of  Fig.  1  using  this  model  is  shown  in  Fig.  4.  When  an  estimate 
of  the  signal  is  subtracted  from  the  noisy  observation,  the  residual  noisy  image  still 
contains  signal  information.  As  an  example,  consider  the  difference  im.ace,  Rq,  obtained 

by  subtracting  the  original  signal  S  from  the  noisy  observation  R  of  Eq.  (li).  This 
difference  signal  is  given  by 


\  P[XSj  -  s 


The  mean  and  variance  of  R^  are  given  by 


*  E[J?[XS]  -  S]  -  0 
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Figure  1.  The  original  image. 


Figure  3.  The  ir.age  recovered  from  Fig.  2. 


Figure  2.  The  noisy  image  with  Gaussian 

noise,  with  <^=4.0  and  o2=*20.0. 


Figure  4.  The  noisy  ir.age  with  Poisson 
noise,  with  >.  =  0.05. 
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(14) 


\  *  eC(^Cis]-s-uRd)23  . 


As  before,  if  we  treat  S  as  an  unknown  constant  equal  to  s,  we  obtain 

flRD|S-«“s/X  (15) 

Thus  an  estimate  of  the  signal,  s,  may  be  obtained  by  using  the  relationship 

*  -  lo2  ,  (16) 

*D 

where  3^  is  an  estimate  of  the  local  variance  of  the  difference  image  obtained  using  RQ  for 

R  inEq.(9).  The  residual  image  obtained  from  the  noisy  observation  of  Fig.  4  is  shown  in 
Fig.  5.  The  estimate  obtained  from  this  difference  image  is  shown  in  Fig.  6.  This  restored 
image  demonstrates  the  ability  of  the  estimator  to  extract  the  gross  structure  of  the  orig¬ 
inal  image  by  making  use  of  the  signal  information  contained  in  the  signal-dependent  noise. 

Signal  Estimation  from  Multiple  Observations 


'  In  some  instances,  such  as  television  or  movie  imaging,  multiple  looks  of  a  given  object 
or  scene  are  available.  In  such  cases  we  may  estimate  the  object  over  an  ensemble  of  sample 
functions,  instead  of  estimating  from  a  single  observation.  Such  an  estimate  does  not  suf¬ 
fer  from  the  spatial  smoothing  effects  that  are  present  in  many  of  the  image  estimation 
techniques.  As  a  result  one  may  expect  a  higher  spatial  resolution  in  restored  images. 
Further,  when  the  noise  corrupting  the  signal  contains  a  signal-dependent  part,  it  is  pos¬ 
sible  to  obtain  a  second  estimate  by  recovering  the  signal  from  the  signal-dependent  noise 
term.  These  issues  are  discussed  in  this  section. 

Consider  an  ensemble  of  noisy  observations  {R^;  i»l,2...M)  obtained  when  a  signal  S  is 
degraded  by  a  set  of  statistically  independent  noise  terms  N11  and  N2i,  with 


S+f  (S)Mu+»2i, 


i-1,2, . . .M. 


If  the  noise  processes  are  modeled  as  zero  mean  processes,  then  an  estimate  of  the  signal, 
§,  may  be  obtained  by  finding  the  sample  mean  over  the  ensemble  at  each  pixel,  using  the 
equation 


.  1  " 

5  •  s  I  R,  .  (18) 

M  i-1  1 

When  such  an  estimate  is  subtracted  from  each  of  the  noisy  observations,  the  residual  images, 
R^,  consist  mostly  of  noise  terms  as  given  by 

RDi  *  f(S)Nli  +  N2i  •  (19) 

Csing  the  procedure  described  earlier,  an  estimate,  §N,  may  be  obtained  from  these  residual 

images.  Thus,  if  c\  represents  an  estimate  of  the  variance  of  the  residual  images  at  a 

V  , 

given  pixel,  and  if  and  Oj  -are  the  variances  of  the  noise  terms  and  N2  respectively, 
then  an  estimate  is  given  by  _ 


*k-  *  L  * 


-r*w. 
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Sots  that  the  variables  in  Ecs.  (17)- (21)  represent  the  corresponding  quantities  at  each 
pixel  location  and  the  sane  estimation  procedure  mu3t  be  applied  to  each  pixel  to  restore 
the  entire  image.  To  evaluate  this  estimation  process,  independent  Gaussian  noise  samples 
were  added  to  the  original  image  of  Fig.  7a,  resulting  in  the  four  noisy  images  shown  "in 
Figs.  7e-h.  The  restored  image  obtained  by  finding  the  sample  mean  at  each  pixel  is  shown 
in  Fig.  7b.  The  residual  images  obtained  by  subtracting  the  image  of  Fig.  7b  from  each  of 
the  r.oisy  observations  are  shown  in  Figs.  7i-l  (as  before  a  constant  background  has  been 
added  to  these  images) .  The  restored  image  obtained  when  Eq.  (20)  is  applied  to  these 
images  is  shown  in  Fig.  7c.  Note  the  remarkable  detail  visible  in  this  restored  image  even 
when  the  sample  size  is  as  low  as  four.  The  signal  recovered  from  each  of  the  difference 
images  shown  in  Figs.  7i,j,k  using  the  spatial  signal  recovery  techniques  described  earlier 
are  shown  in  Figs.  7m,n,o.  The  estimate  shown  in  Fig.  7p  is  the  pixel-by-pixel  mean  of 
the  estimates  shown  in  Figs.  7m,n,o.  The  signal  information  may  also  be  recovered  using 
higher  order  moments.  As  an  example,  the  restored  image  obtained  by  estimating  the  fourth 
moment  of  the  difference  images  and  other  appropriate  equations  is  shown  in  Fig.  7d.  As 
expected,  the  use  of  multiple  frame  observations  has  completely  eliminated  the  spatial 
smoothing  problem  encountered  in  single  frame  estimators.  Further,  these  simulations  have 
indicated  that  excellent  noise  suppression  and  the  recovery  of  good  quality  images  is 
possible  even  when  the  sample  size  is  small. 

Conclusions 

The  potential  advantages  of  the  procedures  described  for  recovering  a  signal  from  sig¬ 
nal-dependent  noise  have  been  discussed  in  this  paper.  It  has. been  shown  that  it  is  pos¬ 
sible  to  obtain  a  fairly  good  estimate  of  the  signal  even  when  all  the  signal  information 
present  in  the  observation  is  in  the  form  of  signal-dependent  noise.  Modifications  to  the 
estimators  in  order  to  recover  a  signal  from  multiple  frame  observations  have  also  been 
presented.  Although  the  techniques  described  are  applicable  to  only  Gaussian  signal-de¬ 
pendent  noise  and  Poisson  noise,  it  is  possible  to  extend  the  same  principles  tc  other 
types  of  signal-dependent  noise,  such  as  magnetic  tape  noise  and  reverberation  noise. 
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Texts  on  classical  partial  differential  equations  (e.g.,  Gara- 
bedian  [_6,p.57]  and  Courant  and  Hilbert  [2  ,p.l55] )  always  develop 
the  well-known  theory  of  canonical  forms  for  second  order  linear 
partial  differential  equations  in  two  variables.  This  micro-local 
result  gives  necessary  and  lufficient  conditions  for  the  reduction 
of  the  top  order  part  of  the  equation  to  constant  coefficients. 

The  purpose  of  this  paper  is  to  investigate  two  macro-local  prob¬ 
lems  on  the  reduction  of  the  top  order  part  of  a  linear  partial 
differential  operator  to  constant  coefficients. 

Section  1  contains  the  preliminaries  and  leads  to  statements 
of  the  two  problems  on  the  geometry  of  linear  partial  differential 
operators.  The  general  results  concerning  second  order  linear 
partial  differential  operators  are  given  in  Section  2.  Section  3 
is  devoted  to  some  consequences  of  the  theorem  of  Cotton,  while 
Section  4  contains  some  miscellaneous  remarks. 
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Section  1:  Preliminaries 

Lac  M  be  an  n-d  loans  tonal  smooch  manifold  and  lac  FCM.pjR01) 

dano'ca  cha  sac  of  smooch  maps  from  M  co  R*  which  arc  defined  in  a 

neighborhood  of  pcM.  If  (x,U)  is  a  chare  of  M,  peU,  and 

a«(a, )  is  an  a- tuple  of  nonnegseive  integers,  va  define 
l  n 

3f/3xa(p)  for  each  f cF(M,p;R)  by 

(1.1)  3f/3xa(p)  -  Da(f*x'1)(x(p)). 

where  D°  denotes  the  usual  partial  derivative  in  Rn.  If 
3«(0, •  •  •  ,0) ,  then  Dag**g.  Define  |a|«a,+-*,a  and  a!»a, !•••(»  !. 

1  Q  X  ft 

For  each  nonnegaeive  integer  r,  we  say  chat  f,  geF(M,p;R“)  are 
r-jec  equivalent  if  for  some  chart  (x.U)  of  M  for  which  pcU  we 

have 

(1.2)  3fl/3x°(p)  -  3g1/3xa(p)  fflr  i»l,***,m  and  |<x|<r. 

It  follows  from  che  chain  rule  that  (1.2)  is  independent  of  the 
chart.  The  r-jet  equivalence  class  of  fcFOi.pjR”)  is  denoted  by 
Jrf(p)  and  che  set  of  all  r-Jec  equivalence  classes  in  F(M,p;RB) 
is  denoted  by  Jr(M,p;RB). 

Jhe  set  Jr(M,p;R*)  has  a  natural  vector  space  structure  inher¬ 
ited  from  F(M,p;RB)  and  Jr(M;Ra)  -  U  Jr(M,p;Ra)  has  a  natural 

pcH 

manifold  structure  which  makes  Jr(M;R°)  with  its  natural  projection 
onto  M  into  a  vector  bundle.  We  also  consider  jet  bundles  with  a 
fixed  target  by  defining 


Jr(M,p;Ra,0)  -  {jrf(p)|feF(M,p;R®)  and  f(p)-O). 
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Proceeding  as  above,  we  obtain  the  vector  bundle  Jr(M;Rm,0)  over  M. 

The  case  tv*l  is  of  special  interest.  The  dual  bundle  of 
Jr(M;R,0)  is  the  r-th  order  tangent  bundle  Tr(M)  of  Pohl  [2] .  Sec¬ 
tions  of  Tr(M)  are  r-th  order  partial  differential  operators  on  M 
(without  constant  term).  In  fact,  charts  of  M  induce  local  bases 
of  sections  for  Jr(M;R,0)  and  Tr(M)  in  a  straightforward  way.  If 

(x,U)  is  a  chart  of  M  and  if  x°”x,al...x  an,  then  the  maps  jrxa 

1  n 

for  0<|a|<r,  defined  by 

jrx“(p)  -  jr((x-x(p))a)  (p)  for  ptU, 

form  the  basis  of  sections  of  Jr(M;R,0)  over  C.  The  dual  basis  of 
sections  of  Tr(M)  over  U  is 

{(l/a!)3/3xa|0<|a|<r). 

In  a  similar  fashion,  the  dual  bundle  of  Jr(M;R)  is  Tr(M)#ll, 
where  A  denotes  the  one-dimensional  trivial  bundle  over  M.  Sec¬ 
tions  of  Tr(M)Ml  are  r-th  order  partial  differential  operators  on 
M  with  constant  term. 

We  will  now  show  that  charts  of  M  are  equivalent  to  the  exist¬ 
ence  of  integrable,  local  sections  of  a  certain  fibre  bundle.  Let 
RF(M,p;Rn)  denote  the  subset  of  feF(N,p;Rn)  which  are  regular  at 
p,  i.e.  for  which  the  n*n  Jacobian  matrix  of  f  relative  to  some 
(and  hence  any)  chart  of  M  near  p  is  nonsingular  at  p.  Let 

RJr(M,p;Rn,0)  -  { jrf (p) | ftRF(M,p;Rn)  and  f(p)-0}. 

The  set  RJr(M;Rn,0)  *  U  RJr(M,p;Rn,0)  has  a  natural  manifold 

pcM 


277 


FREDRICKS 


T  Hi 

structure  which  sakes  RJ  (M;R‘  ,C)  with  its  natural  projection  onto 
M  into  a  fibre  bundle.  A  section  s  of  RJr(M;Rn,0)  is  integrable 
if  there  exists  a  smooth  asp  f:  J**Ra  which  is  regular  on  M  such 
that 

s(p)  *  jr(f-f(p)) (p)  for  each  peM. 

U'e  will  denote  such  a  section  simply  by  jrf. 

Note  that  if  (x,U)  is  a  chart  of  M,  then  jrx  is  an  integrable 
section  of  RJr(M;Rn,0)  over  U.  In  fact,  if  one  identifies  charts 
(x,U)  and  (y,V)  for  which  U«V  and  x-y  is  a  translation  up  to  order 
r,  then  there  is  a  natural  one-to-one  correspondence  between  charts 
of  M  and  integrable,  local  sections  of  RJr(M;Rn,0). 

We  have  already  seen  how  a  chart  (x,U)  of  M,  i.e.  an  Integrable 
section  jrx  of  RJr(M;Rn,0)  over  U,  induces  a  local  basis 
{3/3xa|0<|a|<r}  of  sections  of  Tr(M)  over  II .  These  local  bases  are 
called .coordinate  frames  for  Tr(M)  over  U. 

Suppose  now  that  s  is  a  section  of  RJC(M;Rn,0)  over  U.  For 
each  pell  and  each  a  with  0<|a|<r,  define 

(1.3)  Xa(p)  -  3/3xa(p)  if  s(p)-jrx(p). 

The  saps  {Xal0<|a(<r}  fora  a  local  basis  of  sections  of  Tr(M)  over 
V.  Local  bases  which  are  induced  from  sections  of  Tr(M)  over  U  in 
this  manner  are  called  generalised  coordinate  frames  for  Tr (M)  over 
U .  Rote  that  generalized  coordinate  frames  are  Induced  by  smoothly 
varying  geras  of  coordinates. 

Coordinate  and  generalized  coordinate  frames  for  T* (M)»U, 
Jr(M;R,0)  and  JC(M;R)  can  be  defined  in  the  obvious  ways.  Because 
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of  our  interest  in  partial  differential  operators,  we  will  only  con¬ 
sider  these  frames  for  Tr(M). 

Since  the  geometry  of  partial  differential  operators  is  in 
the  top  order  symbol,  we  can  now  state  the  two  main  problems  on 
"reduction  to  constant  coefficients"  in  the  top  order  part.  Let  P 
be  a  section  of  Tr(M)  and  let  U  be  an  open  subset  of  M. 

Problem  1:  Find  necessary  and  sufficient  conditions  on  P  and  U 
for  the  existence  of  a  coordinate  frame  (3/3xa)  for  Tr(M)  over  U 
and  b^eR  for  which 

P  -  I  b  3/3x“  +•••  on  U. 

The  symbolism  "+•••"  means  plus  lower  order  terms. 

Problem  2:  Find  necessary  and  sufficient  conditions  on  P  and 
U  for  the  existence  of  _a  generalized  coordinate  frame  {Xq]  for 
Tr(M)  over  U  and  bacR  for  which 

P  -  I  fc  X  +•••  on  U. 

M-r  “ 

Mote  chat  the  order  of  X  is  defined  in  the  classical  sense.  Thus 

a 

we  see  that  the  order  of  X  is  I  a  I ,  but  that  X  may  also  include 

a  1  '  a 

lower  order  terms.- 

Since  these  problems  are  macro- local  in  nature,  we  will  con¬ 
sider  M  to  be  an  open  subset  U  of  Rn.  The  following  consequence 
of  the  chain  rule  is  useful  in  translating  these  problems  tc  prob¬ 
lems  in  linear  algebra  and  differential  geometry.  If  {X^}  is  a 
generalized  coordinate  frame  for  Tr(U)  induced  by  a  section  s  of 
RJr(U;Rn,0),  then  for  each  pcM  and  0< I  a | <r  we  have 
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03(p)  -  Z  -37  D1(x=)(p)X.(p), 

0<!£|<Jal 

where  *(p)«Jrx(p) .  Note  that  equality  also  holds  in  th«  preceding 
line  if  one  susis  over  the  indices  0<|Bli,f-  Thus,  if  P  is  a  section 
of  Tr(l"),  say 

P  -  Z  a  D* 

0<|a|^.r  3 

with  a  :  U-*R  smooth,  then 

SI 

(1.4)  P  -  I  (  Z 

0< l_S | *x  \o<l®l<r  -  nl  B 

where  g  :  lf-*R  is  defined  by 

gafl(p)  -  -gf  Da(x6)(p)  if  s(p)-3rx(p). 

Section  2?  General  results 

It  is  convenient  to  use  a  different  type  of  notation  while  con¬ 
sidering  Problems  1  and  2  for  second  order  linear  partial  differen¬ 
tial  operators  on  an  open  subset  U  of  Rn.  Let  denote  the 

standard  unit  vectors  in  R°  and  define 

*i  •i+*J 

-  Dj“D  .  and  D^"D  J  for  i,J"l,...,n. 

In  a  similar,  fashion,  we  define  and  3/3*^  for  a  coordinate 

frame  {3/3x°}  for  T2(U) ,  and  Xj^  and  X^  for  a  generalixed  coordinate 
frame  {X  }  for  T2(U). 

Suppose  now  that  P  is  a  second  order  linear  partial  differencial 
operator  on  U.  Then  P  can  be  written  uniquely  in  the  fora 

(2-x)  p’  A  Vu  +  ;  *iDi  d. 


on  U, 


*  1  X.  on  B. 
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where  are  smooth  functions  and  a^-a^.  It  follows  from 

(1.4)  that  if  {o/3xa}  is  a  coordinate  frame  for  T^(U)  and 

p-  k;t  bki  3/3*ki  +  l  \  3/3*k  SB.  0, 

wlth  bki  ■  blk'  then 

Thus  the  second  order  coefficients  of  P  transform  tensorially 
according  to  the  matrix  equation 

Dx  A  tDx  “  B  on  U, 


*ijDiXkDjXi  —  U  —  k*4-1* 


where  A^  ^-a^,  and  Dx  is  the  Jacobian  matrix  of  the 

coordinates  x.  Given  the  sysmetric  matrix  A  with  smooth  entries, 
this  equation  is  solvable  for  coordinates  x  on  U  and  for  a  constant 
matrix  5  if  and  only  if  it  is  solvable  for  coordinates  x  on  U  and 
a  constant  matrix  of  the  form 

B  -  diag  (1, •••,1,-1, •••,-!, 0,*"C), 

?»M 

with  p  l's  and  q  -l's,  where  p  and  q  are  nonnegative  integers  with 
p+q<n. 

■Thus  we  have 

Problem  1  (r"2):  Find  necessary  and  sufficient  conditions  on  P  in 
(2.1)  and  U  for  the  existence  of  coordinates  x  on  U  and  nonnegative 
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If  \Xa)  is  a  generalized  coordinate  frame  for  T  (U) ,  then  we  obtain 
the  relationship  (2.2)  where  the  coordinates  x  are  germs  of-coordi- 
nates  which  vary  smoothly  on  U.  Thus  we  have 


Problem  2  (r»l) :  Find  necessary  and  sufficient  conditions  on  P  in 


(2.1)  and  U  for  the  existence  of  t  smooth  map  G:  U-*CL(n)  and  non¬ 


negative  integers  p  and  q  for  which 


(2.3)  G  A  CG  «  3  on  U. 

P.<J  — 

Note,  from  Sylvester's  Theorem,  chat  a  necessary  condition  for 
solving  (2.3)  is  that  A  has  p  positive  and  q  negative  eigenvalues 
at  each  point  of  U.  Under  this  assumption  we  have  (see  (S)  for 
more  details)  that 


E  -  {(u.g)eUxCL(n)  |  g  A(u)  rg  -  3,,  } 


is  a  submanifold  of  U*GL(n)  and  that  (E,»,U),  with  *  denoting  the 
obvious  projection  of  E  onto  U,  is  a  fibre  bundle.  Since  the  de¬ 
sired  map  G  is  a  section  of  this  fibre  bundle,  we  can  guarantee 
the  existence  of  such  a  map  by  assuming,  for  example,  that  U  is 
smoothly  contractible  (see  Steenrod  [9.P-531).  Hence  we  have 
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if  and  only  if  A  has  p  positive  and  q  negative  eigenvalues  at  each 
point'  of  C. 

To  solve  (2.2)  we  need  a  smooth  map  G:  U-CL(n)  which  satisfies 
(2.3)  and  for  which  there  exists  a  solution  to  the  system 

D.x.  »  g..  on  U  for  i,j»l,...,n, 
j  x  —  -  J  ’ 


where  G..  _,.“g...  From  basic  existence  and  uniqueness  theorems 
(i.j)  if) 

for  systems  of  first  order  partial  differential  equations  we  see 
that  local  solutions  exist  if  and  only  if 


DkSi;j  *  Dj8ik  SB  u  ISl  i.J.k-1 . n. 

If  the  matrix  A  is  nonsingular  on  U,  i.e.  ,  P  is  nondegenerate  on 
U,  then  the  preceding  "integrability  conditions"  can  be  written  as 
curvature  conditions  on  A.  (See  Spivak  (8,  ch.4D]  for  more  de¬ 
tails.)  Recall  that  the  components  R^^  of  the  Riemann  curvature 
tensor  corresponding  to  A  are  defined  for  i, j ,k,£*l, . . . ,n  by 


RijkJ."  (DjkaU+Ditajk-Dj  iaik‘Dikaj  i5  /2 


+  gZt  ast(  [ii,s]  [jk,t]-[ik,s]  [jt,t])  , 


where  a“J<j}  -  a.,  and  [ij  .^-(D^^a.^D^) /2. 

As  in  the  macro-local  theory  of  canonical  forms  for  second 
order  linear  partial  differential  equations  (see  Finn  [£)),  the 
"integrability  conditions"  are  sufficient  only  on  certain  domains. 
The  Poincare  lemma  with  compact  supports  can  now  be  used  to  obtain 
the  following  macro-local  version  of  a  theorem  due  to  Cotton  []j  . 
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Thooraa  2.3;  Lft  ubi  i  snoot hi v  contractible  ogw  tub— c  of  Rn 
fld  lot  P.  ilwon  fa  (2.1)  Ofi  6.  bo  nondosonoroto  on  D.  For  — ch 
ooon  anboat  0  of  1°  rich  5  connect  oad  contained  In  D  thorn  diti 
£  coordinate  frana  {a/ta*>  for  T2 (D)  for  which 
P  ® 

t  -  t  a/a*..  -  :  a/a*..  ♦  ...  on  o 

1-1  u  1-7+1  33 

If  end  only  If  A  boo  »  poaltlwe  olgonralees  «  agg  gojac  (and  honco 
•11  Points)  of  0  on d  tho  connononto  R^j^  of  Jjam  Rlonww  curvature 
Connor  corresponding  to  A  wnlih  on  fi. 


Soctlon  2:  Conooouoncoo  of  tho  thoornn  of  Cotton 

Ho  will  man  throughout  this  Motion  that  8  and  U  are  open 

aubaaca  of  ta  with  tho  proportion  otncod  la  Iheoran  2.5.  Koto  (nee 

Iloonhnrt  (2.  p.21])  choc,  dno  to  vorlouo  rolatlonohlpo  bocwoon 

2  2 

tho  cooponontn  lyU,  chore  are  n  (n  -1)/12  eondltloao  la  tho  ^ 
vanishing  of  tho  Hannon  enrvntnro  toanor  corresponding  to  A. 

Ho  now  begin  •  a tody  of  tho  caoo  n-2,  whore  wo  hove  a  single 

i 

wsalahlag  condition  on  tho  ewrrotnro,  noasly,  on  U.  Con¬ 

sider 


(3.1)  P  •  iD^+abDj^+cD^+dDj+eDj  on  6, 

whore  o,b,e,d,o:  9*1  ore  naooth,  and  asstas  that  ?  la  aoodegonoroto 
on  6.  Ho  hove 


a  b 

b  e 


and  wo  lot  A 


o '  B 
t  T 


A  ealeulatlcn  shows  chat  1^221  vaa^4h*c  on  E  if  only  If 
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(32)  2Yu-4S12+2«22+.(2«1B2-fllYl-.22)+c(2eiY2-«2Y2-Y12) 

4b(a1Y2-«2Yx+ft*i82“28lYl“2a282^"0  —  ®» 

where  we  have  used  f t  end  f  to  denote  D^f  end  D^f ,  respectively. 
Examination  of  the  eigenvalue  condition  now  gives  the  following 
consequence  of  Theoreu  2.5. 


Theoren  3.3:  For  P  slven  In  (3.1),  there  exist  coordinates  (x,y) 
on  U  for  which 

(a)  P  -  8/a*2  ♦  3/3y2  +•••  on  Uj 

(b)  P  «  -a/a*2  -  a/ay2  +• ■ •  on  0; 

(c)  p-  a/a*2  -  a/ay2  ♦•••  ^u; 

If  and  only  If  (3.2)  holds  and,  respectively . 

(a)  det  A  >  0  on  0  and  a  >  0  on  6: 

(b)  det  A  >  0  on  0  and  a  <  0  on  8; 

(c)  det  A  <  0  on  P. 

tie  remark  that,  although  condition  (3.2)  Is  rathsr  long.  It 
Is  generally  much  easier  to  check  this  condition  In  a  specific 
exasple  than  It  la  to  try  to  find  coordinates  which  nay  or  nay  not 
exist.  This  remains  true  even  for  aord  variables,  where  we  may 
have  a  larga  number  of  conditions  similar  to  (3.2). 

Instead  of  using  Theorem  3.3  to  obtain  results  about  specific 
operators  P,  we  will  assume  that  P  has  a  special  form  and  then 
apply  Theorem  3.3  in  that  situation.  For  example,  letting  agn  de¬ 
note  the  sign  function,  we  have 
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\  sV 
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Corollary  3.»:  ll_  ••  on  V  with  a  and  c  aonvanlahlng 

on  1‘,  eh  as  Chart  txlat  coordlnacta  (x.y)  on  U  for  which 


t  »  «r»(a)S/>j  -  ♦  agn(e)J/»y2-f--  on  0 


itiaiaalxM 

3a3c12+3c3a22-2a3ccu-2ae3a23-a2ca1c1-ac2a2c2-0  on  5. 

Cotollarr  3.3i  *aal>u+2bDu+‘  •  •  on  0  with  b  oonranlahlna  gn  0, 
than  chara  anile  coordlnataa  (x.y)  on  0  for  which 

t  -  »/3x2-»/»y2  ♦•••  gn  0. 


if  and  only  if 

h2«u-2abh^1-2b2bua>»blb2-3b«1b1-Kabl2-0  gg  3. 

Zootoad  of  acaciog  alailar  raanlca  for  auiaai  form  of  opar- 
ocora  t,  m  will  coacaacraca  on  alcwaciona  whara  eba  gaaaral  no- 
lucion  co  cha  aoaliaaar  partial  diffara&cial  aqnadoa  (3.2)  can  ba 
found.  For  axaepla.  locclag  wa  hm 

Corollary  3.4:  ^  t» a*2+’ •  •  on  5  with  a  nonyMi-M^g  gg  0,  chan 
cfeaKt  adit  coordinataa  (x.y)  an  C  for  which 

t  •  agn(a)(»/»x2a>»/jy2)+. .. 

II  12i  only  II  ?2(in|a|)  ■  0  j£D. 


Tha  proof  aataila  lacciag  a*c  in  Corollary  3.4  and  noting  chat 

2*3<«*u-4l2+M22-a22>  *  2aV(m|a|). 
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Sine*  every  bounded  harnonlc  function  on  R2  i s  constant,  we 
obtain  the  following  result. 

Corollary  3.7:  If  P»aV2  +• * •  on  R2  with  a  nonvanlahlns  and  bounded 
on  R2,  then  there  do  not  exist  coordinates  (x,p)  on  R2  in  which  the 
top  order  part  of  P  has  constant  coefficients  unless  a  Itself  is 
constant. 

A  result  similar  to  Corollary  3.6  can  be  proven  forQ-D^-Djj 
by  noting  that 

2a3(aau-a12-aa22+a22)  •  2aSQ(in|a|) 
and  then  using  Corollary  3. A  to  get 

Corollary  3.8:  If  P-alH  +•••  on  U  with  a  nonvanlshlni  on  U.  then 
there  exist  coordinates  (x.y)  on  U  Jo£  which 

P  •  »/»x2-*/Sy2+-  •  • 23.  0. 

if  and  only  if  ri(lnlal)H)  on  0,  i.e. ,  if  and  only  if  there  exist 
snooth  functions  f  and  g  for  which 

tn|a(u,v)|  ■  f(v*v)  +g(u-v)  for  each  (u,v)cU. 

Her*  we  us*  (u,v)  to  denote  the  canonical  coordinates  on  R2.  A 
simple  consequence  of  Theoren  3.3  (or  Corollary  3.3)  is  the 

o 

Corollary:  If  P-aD11+2D12+*  • •  on  V,  then  there  exist  coordinates 
(x,y)  on  0  for  which 

P  -  »/»x2-8/8y2+*-*  on  0, 
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If  tad  only  if  thsra  axlst  »  sooth  function*  f  and  g  for  which 
a(u,v)  -  f(v)u  +g(v)  for  jgfih.  (u,v)*6. 

Dm  mxi  malt  follows  froa  Thooroa  3.1  after  noting  chat 

“l2'blb2  “  <»«|k|)12A2. 

Corollary:  If  yfbP^*---  on  0  with  »  nonvaalshlos  on  5.  than 
thoro  salat  coordinates  (x,y)  on  0  fj£  which 

r  •  a/»*2-»/»yV-*  oa  o. 

If  sod  only  it  thsra  aalst  snooth  functions  f  |  for  which 

b(u,y) f(w)g(y)  for  aach  (n,y) tij. 

Oa  eastloo  ana  flaal  cosaogwanca  of  thaoraa  3.3  bacawaa  of  lea 
ralatioasblp  to  Laplaca'a  oparstor  Is  polar  coordlsatas. 

Corollary:  y*°u‘feD22‘*"  ‘ '  *lttt  c  — ' fit  aai  it" 

osndlss  oslr  oo  w.  Chaa  Chars  sxHt  coordlaatas  (*.y)  0  itt 

AM 

T  •  8/lx2  ♦  sgn(g)8/8y2o**-  SA  ®» 

lai  aals  1 t  «•  MUt  <»■  a  asbisbi*  M  At 

Ism 

o 

c(u)  ■  (u-t)”2  sons  ecu. 

Va  now  consider  soaa  higher  dlsena tonal  raaulta.  Slaca  tha 
maker  of  conditions  for  tha  vanishing  of  tha  consonants 
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lnertUM  rapidly  as  a  increases,  results  similar  to  those  already 
given  in  this  section  become  harder  to  obtain  as  n  increases. 

To  get  an  idea  of  the  conditions  Involved  in  the  81— arm  tensor 
ve  will  state  only  one  result  in  three  dimensions. 

Theorem;  If  P*aD2j+bD22+cD 33+* •  •  on  0  with  a,b,c  nonvan iehine  on 
0,  then  there  exist  coordinates  (x.y.z)  on  U  for  which 

P  *  sgn(a)3/3x2+sgn(b)3/3y24*gn(c)a/at2+“*  on  0, 

if  and  only  if  the  followins  hold  on  U: 

a2ba1b1+ab2a2b2-3b3a22+2ab3a22-3a3b12+2a3bb11-abca3b3-0 

a2ca1b1+ac2a3c3-3c3a32+2ac3a33-3a3e12+2a3ecll-abca2c2-0 

b2cb2c2-tbc2b3c3-3c3b32+2bc3b33-3b3c22+2b3cc22-abcb1c1-0 

2abca23+aba3c2+aca2b3-3bca2a3«0 

2abcb23+abb3c2+bca3b1-3acbjLb3*0 

2abcc12+acb1c2+bca2c1-3abclc2-0. 

Since  we  will  now  turn  to  results  which  are  true  in  n  dimensions, 
we  will  temporarily  cease  using  subscripts  to  denote  partial  deri¬ 
vatives  . 

Let  u, ,...,u  denote  the  canonical  coordinates  of  Rn.  Rote 
x  n 

that  if  n>2  and  A“diag{a^, • • • ,an)  with  each  aa  a  function  of  u^ 

* 

alone,  then  (lj,k]"0  for  i^J  or  idk  and  fil.i]  •  (l/2aj)  . 


FREDRICKS 


Ve  now  in  Clue  *1^*0  *OT  *H  l.J.k.i.  and  hence  hava  tha  following 
consequence  of  Thaoran  2.3* 

Corollary:  tot  * '♦*aDBB**  *  *  ®£  5  with  a1 , . . .  ,«B  oon- 

raalshlna  on  0  and  each  aa  a  function  of  «a  alona.  If  p  la  tha 
ayg£2£  of  m  for  which  aa>0  on  6,  than  thara  exists  a  coordinate 
fraaa  fo£  T^U)  f£T  which 

P  n 

F  -  I  a/8*..  -  I  a/8*.>**‘  on  0. 

i«l  J-p+I  33 

In  Corollaries  3.6  and  3.S  wa  considered  P-aT^ • •  and 
P»a  C+*  •  • ,  respectively.  Tha  corresponding  cases  in  higher  di- 
nans  ions  ara  covered  in 

»»2  MS*  P  H  ftoad  »£&  Ofp*n.  i££ 

*x.'**’%  **li  I£ 

P«a(J1Djj+‘  •  ■+^n*}BB)+' '  *  jBft  5  with  a  nonvaniahinn  op  6,  than  chare 
exists  £  coordinate  frsna  { J/8x°}  for  T*(B)  for  which 

P  •  s*n(a)(d1a/»*li+***+<a»/a*aa)+'*'*  on  U, 

jf  and  only  If  |a|  has  one  o£  tjia  followini  foras  on  S: 

r  [  Ji  6*<v‘«>2  ]  2  “  [  W  ] *• 

* 

whara  r,itl,  t-ft^ . tB)t  Ra  jg£  q*(qlt...,qB)«Ra  which  satisfies 

a  2 

*  «»<  "  •• 
o-l 

Sots  that  the  conditions  in  tha  cases  p*n  and  ?"0  ara  that  a 
is  constant  or  has  tha  fora  r | u—  1 1  on  U,  whara  rcR  and  tfC. 
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2 

Letting  7  *dii+*"+cbb»  u*  DOW  have  the  following  higher  diaen- 
eionel  analog  of  Corollary  3.7. 

Corollary:  If  P»aV2+* • •  on  Rn  with  n>2  and  a  nonvaniahlnt  on  Rn, 
then  there  does  not  exist  a  coordinate  fraae  for  T^R®)  in 

which  the  top  order  part  of  P  has  conatant  coefficlente  unleaa  a 
itself  is  constant. 

Also  note  that  if  X  denotes  the  Lorentslan  distance  function, 
then  the  condition  In  the  case  p“n-l  of  Theorea  3.9  is  chat  a  la 
of  the  fora  [«q,u>+sj2,  where  scR  and  q  Is  in  the  light  cone 
centered  at  the  origin,  or  a  la  of  the  fora  r(X(u,t)]\  where  rcR 
and  the  light  cone  centered  at  t  does  not  asst  U. 

We  conclude  this  section  with  Che 

Proof  of  Theorea  3.9:  Since  A-a  dlag  {«1,...,dji)  and 

A-1-  dlag  {£^ . «n}/a.  It  follows  that  (lj,k]»0  If  l,j,k  are 

distinct.  One  can  now  easily  shew  that  *^^"0  if  l,j,k,l  are 
distinct.  Returning  to  the  use  of  subscripts  to  denote  partial 
differentiation,  direct  calculations  show  that 

’ ,V.2  a  m 

0*1 

and 

**  *ijit"*l^2**jl"*j*t^  —  ate  distinct. 

Due  to  the  various  relationships  between  the  conponents  R^^  we 
see  that  the  curvature  conditions  in  Theorea  2.5  for  A  are 
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-0 

on  6  for  1<J . 

-0 

l 

W«  aov  Him  due  a>0  os  0  and  noco,  la  ehis  cbm,  due  (3.11)  is 
equivalent  to  (a1^2)^"©  on  6.  Siaes  this  holds  for  all  l<j,  it 
follows  by  integration  that  (3.11)  holds  for  a>0  if  and  only  if 
ebaro  exist  saooch  functions  fB  of  u^  alone  such  chat 

*  2  * 
a(u)  •It  f.(u.)i  on  0. 

a>l  *  * 

A  calculaeloa  bow  shows  due  a  function  of  this  fora  sacisfles 
(3.10)  if  and  only  if 

(3.12)  (IfB>  11^  ^fj  1  .  E«B(f  j2  OB  P  for  i«J. 

M  N  o» 

We  bow  see  due  djf^  •  •••  •  d#fB  os  0  and  hence,  since  each  f^ 
la  a  fnaecion  of  «a  aloaa,  lc  follows  due 

(3.13)  dJfl"  .....  dBfB"  •  2r, 
where  ret. 

If  HJ,  then  fa  is  a  linear  function  of  u^  aad  it  follows  froa 
(3.12)  that  a(u)«l«g,u>4s]2  on  U,  where  scR  and  qcRn  which  satisfies 
the  coadlcloa  gives  1b  the  statMaat  of  the  thaorea. 

Assubb  bow  that  rdO  aad  note  froa  (3.13)  that,  for  each 
a«l,*.*,B,  there  exist  s  .ccX  such  thae 


(3.10)  j  (2-j V«b*. 


(3.11) 


^U-i* 


IdaCd^-t,)2**,!  on  U. 

(3.12)  that  I  »B“0  and  hence  we  see  that 
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a(u)  -  r2II«  (u -t  )2]2  on  6. 

_  ■  B  ■ 


If  a<0  on  0,  than  |a|>0  on  U  mad  tho  preceding  proof  gives  the 
desired  result. 

la  concluding  this  section,  vs  rsasrk  that  since  the  top  order 
pert  of  P  in  (2.1)  transform  according  to  a  formula  which  is  in¬ 
dependent  of  the  lower  order  coefficients,  the  results  of  Sections 
2  and  3  also  hold  for  quasi- linear  operators. 


Section  A:  Miscellaneous  ranarka 

The  higher  order  cases  of  Problsm  1  end  2  are  auch  more 
difficult.  In  the  case  r-3  we  see  that  if  P  Is  a  third  order 
linear  partial  differential  operator  on  an  open  subset  C  of  RB, 
■ay. 

p "  i.J.k  *«k  D«k  + **'  **  °* 

trie  la  lta  Indices,  and  If 


with  a^s  0*ft  snooth  and 


F  "  £  fcl»,/hta+,"SSB 

t.n.P  ^ 

for  a  coordinate  frene  O/Jx*}  for  T2(0)  with  b^  syonetrlc  in 


its  Indices,  then 


btt*  ■  *  k  *ljk  Vt  ®3X.  Vp  SB.  0  18L  *.* 

A s  in  the  case  r*2f  vt  now  sad  that  if 


A  ‘  i  j.k  *«k  5*5k  • 


i 
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is  the  symmetric  cubic  fora  sssocistsd  co  P,  Chen  Problem  1 
[respectively,  Problem  2]  for  r-3  can  be  stated  u  finding  necessary 
sad  sufficient  conditions  for  tha  existence  of  coordinates  x  on  U 
[respectively,  e  eaooch  cap  C:  U-Ct(n)  ]  such  that 

Bx  A  -  B  [respectively,  C  A  ■  B)  on  U, 

where  1  is  s  reel  sysmatrie  cubic  fora  sad  the  operation  on  the 
left  is  defined  po in twice  as  the  usual  action  of  CL(n)  on  the 
space  of  real  symmetric  cubic  forma.  We  therefore  have  the 
obvious  necessary  condition  on  P  chat  the  real  symeerlc  cubic 
forme  A(u)  for  each  ucU  belong  eo  the  seas  orbit  of  this  action  of 
GL(n) .  Kota  that  this  condition  is  vary  strong,  since  it  is  well- 
known  chat  there  are  an  infinite  number  of  such  orbits.  We 
suspect,  however,  chat  if  V  is  smoothly  contractible,  then  this 
constancy  of  orbit  condition  is  necessary  sad  sufficient  for 
solving  Problem  2.  as  in  the  case  r-2,  a  solution  to  Problem  1  is 
provided  by  a  solution  G  to  Problem  2  which  satisfies  the  usual 
incegr ability  conditions.  Transferring  these  incegr  ability  con¬ 
ditions  to  conditions  on  A  in  certain  "generic”  cases  will  in¬ 
volve  the  vanishing  of  some  type  of  "higher  order"  curvature. 

The  preceding  remarks  carry  ever  in  the  obvious  way  to  the 
cases  r>3.  We  remark,  however,  chat  the  general  problen  of  finding 
invariants  which  determine  the  orbits  of  the  action  of  CL(a)  on 
the  space  of  real  symmetric  r-llnear  forms  is  still  open. 

Finally,  we  remark  chat  one  can  consider  the  problems  of 
(total)  reduction  to  constant  coefficients  by  changing  "|a|-r"  to 
”0«|ai<r"  end  deleting  "+•••"  in  Problem  1  end  2.  In  [ 1) ,  Cotton 
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considers  this  version  of  Problem  1  in  the  case  r»2.  To  illustrate 
the  form  of  these  new  problems,  one  can  proceed  in  the  case  r»2 
as  in  the  beginning  of  Seccion  2  Co  obcain 


Problem  1' :  Find  necessary  and  sufficient  conditions  on  P  in  (2.1) 
and  U  for  the  existence  of  coordinates  x  on  U  and  b^.b^cR  for 
which 


(4.1) 


ki 


t  a^  DiXj[  D^xt  on  U  for  k,i“lf***,n,  and 
1»  j 


*  *ij  Dij*k  +  J  *i  Di*k  SS.  V  lor  k“l, •  ••  ,n. 

*>»  J  * 


Problem  V  :  Find  necessary  and  sufficient  conditions  on  P  in  (2.1) 
and  U  for  the  existence  of  smooth  maps  g^  ,g^k:  with 

8ijk  *  8ikj  iSi  dat(g^)#  0  on  U  and  bk.*>kitR  for  which 


and 


bki  “  *  *ij  «ki  *ij  £2  C  *21 

(4.2)  bk  -  E  a^  gklj  +  1  «t  Rfci  SS.  o  lSL  k-l,***,n. 
it  J  i 


There  arc  several  ways  to  approach  these  problems.  The 
approach  to  Problem  2'  is  to  find  a  solution  G»(g^):  to  Prob¬ 

lem  2 (r"2)  and  then  solve  the  system  of  equations  (4.2)'  for  the 
unknown  functions  Since  this  is  now  viewed  as  a  system  of 

n  equations  in  n(  2*)  unknowns,  we  expect  that  solutions  exist  to 
Problem  2'  under  very  mild  conditions.  The  first  approach  to 
Problem  1'  is  to  seek  solutions  to  Problem  2'  which  satisfy  the 
integrabillcy  conditions 


W<v\ 
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The  second  approach  to  Problem  I’  la  to  find  coordinates  x  on  0 
"hich  solve  Problem  1  (r-2)  and  check  to  see  If  the?  satisfy  (4.1). 
Sots,  hwwnrer,  that  the  b^'s  are  uniquely  determined  by  P  and  the 
coordinates  x,  and  hence  a  study  of  Problem  V  under  this  approach 
would  involve  a  study  of  systems  of  coordinates  which  preserve 
constant  coefficients  in  the  top  order  part. 

In  cases  where  r>2,  there  la  one  exception  to  the  preceding 
renarks.  This  Is  caused  by  the  fact  that,  although  the  numbers  of 
equations  and  unknowns  la  the  ays ten  analogous  to  (4.2)  increase 
as  r  increases,  they  do  It  In  a  sort  of  Inverse  way  according  to 
levels.  Thus  the  subsystem  corresponding  to  the  part  of  order 
r-1  would  be  viewed  as  s  system  of  (“Jlj2)  equations  la  n(“Jl) 
unknowns.  Thus  we  are  lad  to  the  conclusion  that  there  will  be 
much  stronger  conditions  for  the  existence  of  solutions  to 
Problem  2*. 
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